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(54) Tille: METHODS AND COMPOSITIONS FOR SEAMLESS CLONING OF NUCLEIC ACID MOLECULES 

© 

© (57) Abstract: The present invention is in the fields of biotechnology and molecular biology. More particularly, the present invention 
^ relates to cloning or subcloning one or more nucleic acid molecules comprising one or more type lis restriction enzyme recognition 
Q sites. The present invention also embodies cloning such nucleic acid molecules using recombinational cloning methods such as those 
employing recombination siles and recombination proteins. The present invention also relates to nucleic acid molecules (including 
RNA and iRNA), as well as proteins, expressed from host cells produced using the methods of the present invention. 



METHODS AND COMPOSITIONS FOR SEAMLESS 
CLONING OF NUCLEIC ACID MOLECULES 



Background of the Invention 

Field of the Invention 

[0001] The present invention is in the fields of biotechnology and molecular 

biology. More particularly, the present invention relates to seamlessly cloning 
or subcloning one or more nucleic acid molecules. The present invention also 
relates to seamless cloning of nucleic acid molecules comprising one or more 
type lis restriction enzyme recognition sites. The present invention also 
embodies cloning such nucleic acid molecules using recombinational cloning 
methods such as those employing recombination sites and recombination 
proteins. The present invention also relates to nucleic acid molecules 
(including RNA and iRNA), as well as proteins, expressed from host cells 
produced using the methods of the present invention. 

Related Art 

[0002] A significant problem with many of the currently available molecular 

cloning techniques results from the reliance upon restriction sites. These 
techniques result in the presence of extraneous polynucleotides in the 
amplification products even after restriction digestions. Such extraneous 
polynucleotides can introduce design limitations on the cloned product which 
often interfere with the structure and function of the desired gene products, be 
they RNA, DNA or protein. 

[0003] One method of joining nucleic acids without introducing extraneous 

bases or relying on the presence of restriction sites is splice overlap extension 
(SOE) (Yon et al, Nucl Acids Res. 77:4895 (1989) and Horton et al, Gene 
77:61-6% (1989)). This method is based on the hybridization of homologous 
3' single-stranded overhangs to prime synthesis of DNA using each 
complementary strand as a template. . Although this technique can join 
fragments without introducing extraneous nucleotides (in other words, 
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seamlessly), it does not permit the easy insertion of a DNA segment into a 
specific location when seamless junctions at both ends of the segment are 
required. Nor does this technique allow for joining fragments with a vector. 
Ligation with a vector must be subsequently performed by incorporating 
restriction sites onto the termini of the final SOE fragment. Finally, this 
technique is particularly awkward when trying to exchange polynucleotides 
encoding various domains or mutation sites between genetic constructs 
encoding related proteins. 
[0004] Sorge et al 9 U.S. Patent No. 6,261,797 describe a method by which 

polynucleotide sequences of interest are synthesized using one or more 
synthesis primers, wherein at least one of the primers is a releasable primer. 
After synthesis, the synthesis product is cleaved by a releasing enzyme. The 
releasable primers of Sorge et al comprise a recognition site for a type lis 
restriction endonuclease, principally EamllOSL This then allows for 
"seamless domain replacement" where synthesis reactions allow the 
production of a polynucleotide of interest by synthesizing two different 
polynucleotide sequences using separate sets of primers, cleaving the synthesis 
products with a releasing enzyme, and ligating together the two sets of release 
synthesis products. 

Type lis Restriction Enzymes 

[0005] Restriction enzymes can be grouped based on similar characteristics. 

In general there are three major types or classes: I, II (including lis) and HI. 
Class I enzymes cut at a somewhat random site from the enzyme recognition - 
sites (see Old and Primrose, Principles of Gene Manipulation, Blackwell 
Sciences, Inc., Cambridge, Mass., (1994)). Most enzymes used in molecular 
biology are type H enzymes. These enzymes recognize a particular target 
sequence (i.e., restriction endonuclease recognition site) and break the 
polynucleotide chains within or near to the recognition site. The type n 
recognition sequences are continuous or interrupted. Class Hs enzymes (i.e., 
type Hs enzymes) have asymmetric recognition sequences. Cleavage occurs at 
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a distance from the recognition site. These enzymes have been reviewed by 
Szybalskief a/. Gene 100:13-26 (1991). Class m restriction enzymes are rare 
and are not commonly used in molecular biology. 

[0006] Type-Hs endonucleases generally recognize non-palindromic 

sequences and cleave outside of their recognition site, thus producing 
overhangs of ambiguous base pairs. (Szybalski, Gene 40:169-173 (1985).) 
Additionally, as a result of their non-palindromic recognition sequences, the 
use of type-Hs endonucleases will generate more markers per kB than a 
similar type-H endonuclease, e.g., approximately twice as often. U.S. Patent 
No. 4,293,652 discloses a linker with a type-Hs enzyme recognition sequence 
to permit synthesized DNA to be inserted into a vector without disturbing a 
recognition sequence. Brousseau et al {Gene 1 7:279-289 (1982)) and Urdea 
et al (Proa Natl Acad. Sci. USA 50:7461-7465 (1983)) disclose the use of 
type-IIs enzymes for the production of vectors to produce recombinant insulin 
and epidermal growth factor respectively. , 

[0007] Thus, there remains a need in the art for methods and compositions 

that allow for insertion of nucleic acid molecules into specific locations of 
other nucleic acid molecules with seamless junctions at one or both ends. 
There is also a need in the art for methods and compositions that allow for 
transfer of these seamlessly cloned sections from one nucleic acid molecule to 
another. The present invention fulfills these needs. 

Brief Summary of the Invention 

[0008] The present invention provides methods of seamlessly cloning nucleic 

acid molecules. The seamless cloning methods of the present invention may 
utilize, for example, any restriction enzyme, including those which cleave 
nucleic acid molecules to produce blunt ends. Suitably, the methods of the 
invention utilize type lis restriction sites and enzymes that recognize and 
cleave at such sites, which allow for the insertion of one or more (e.g. one, 
two, three, four, five, etc.) nucleic acid segments into specific locations of a 
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second nucleic acid molecule with seamless junctions on one or both ends. 
The present methods are also suitable for the production of nucleic acid 
molecules (e.g. DNA, RNA, DNA hybrids and the like) that only contain 
nucleic acid sequences that are desired in the product molecule and that lack 
extraneous unwanted sequences, for example sequences comprising or 
encoded by restriction sites. The present invention also provides for protein 
molecules produced or encoded by the cloned nucleic acid molecules of the 
invention, that contain only amino acid sequences that are desired in the 
product protein molecule (e.g., a native or mature protein, a fusion protein, 
and the like), and that lack extraneous amino acids, for example amino acids 
encoded by restriction sites. In certain embodiments, nucleic acid molecules 
of the present invention are especially suitable for use as interfering RNA. 
The present invention also provides novel vectors comprising type lis sites 
and, optionally, selectable markers for the production of seamlessly cloned 
nucleic acids, as well as compositions and kits for practicing methods of the 
invention. 

[0009] In one aspect, the present invention provides methods for joining one 

or more (e.g. one, two, three, four, five, etc.) first nucleic acid molecules and 
one or more second nucleic acid molecules, comprising: (a) combining the 
first and second nucleic acid molecules under conditions sufficient to allow for 
the joining of at least one terminus of the first nucleic acid molecule(s) to at 
least one terminus of the second nucleic acid molecule(s), wherein the 
terminus of the first nucleic acid molecule(s) which is connected to the 
terminus of the second nucleic acid molecule(s) comprises a sticky end (e.g. 
an overhanging end) generated by a restriction enzyme (e.g. a type lis 
restriction enzyme) and the terminus of the second nucleic acid molecule(s) is 
compatible (e.g. a blunt end or a sticky end) with this sticky end. In 
embodiments similar to the above and elsewhere herein, the sticky end may be 
on the terminus of the second nucleic acid molecule, and the first nucleic acid 
molecule may contain the compatible end. 
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[0010] In suitable such embodiments, the present invention provides methods 

of cloning or subcloning one or more desired nucleic acid molecules 
comprising: (a) combining in vitro or in vivo, (i) one or more first nucleic acid 
molecules comprising one or more sticky ends that have been generated by 
one or more restriction enzymes, (e.g. one or more type As restriction 
enzymes); and (ii) one or more second nucleic acid molecules comprising one 
or more ends which are compatible with the one or more sticky ends on the 
first nucleic acid molecule(s) and, optionally, one or more selectable markers; 
and (b) incubating the combination under conditions sufficient to join the first 
nucleic acid molecule and one or more of the second nucleic acid molecules, 
thereby producing one or more desired product nucleic acid molecules. 

[0011] In other aspects, the present invention provides methods for cloning or 

subcloning one or more desired nucleic acid molecules comprising: (a) 
comb inin g in vitro or in vivo, (i) one or more first nucleic acid molecules 
comprising one or more sticky ends that have been generated by one or more 
restriction enzymes (e.g. one or more type lis restriction enzymes); (ii) one or 
more second nucleic acid molecules comprising one or more restriction sites 
{e.g. one or more first type lis restriction enzyme recognition sites) and, 
optionally, one or more selectable markers; and (iii) one or more restriction 
enzymes (e.g., one or more type lis restriction enzymes) that are specific for 
the one or more restriction sites on the second molecules; and (b) incubating 
the combination under conditions sufficient to join the first nucleic acid 
molecule and one or more of the second nucleic acid molecules, thereby 
producing one or more desired product nucleic acid molecules. 

[0012] In additional related aspects, the present invention provides methods 

for cloning or subcloning one or more desired nucleic acid molecules 
comprising: (a) combining in vitro or in vivo, (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment that is flanked by one 
or more restriction sites (e.g. one or more first type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more ends which are compatible with a sticky end on the segment and, 



optionally, one or more selectable markers; and (iii) one or more restriction 
enzymes {e.g., one or more type Us restriction enzymes) that are specific for 
the one or more restriction sites on the at least one nucleic acid segment; and 
(b) incubating the combination under conditions sufficient to join the first 
nucleic acid segment and one or more of the second nucleic acid molecules, 
thereby producing one or more desired product nucleic acid molecules. 

In related aspects, the present invention provides methods for cloning 
or subcloning one or more desired nucleic acid molecules, or portions thereof, 
comprising: (a) combining in vitro or in vivo, (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment that is flanked by one 
or more first restriction sites (e.g. one or more first type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more second restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites) and, optionally, one or more selectable markers; and 
(iii) one or more restriction enzymes (e.g. one or more type lis restriction 
enzymes) that are specific for the first and/or second type lis restriction 
enzyme recognition sites; and (b) incubating the combination under conditions 
sufficient to join the first nucleic acid segment and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. 

Type lis restriction enzyme recognition sites and type lis restriction 
enzymes that are useful in the present cloning methods, compositions, nucleic 
acids, vectors and kits include, but are not limited to, Bsal, Bbsl, BbvE, 
BsmAI, BspMl, Eco3U, BsmBl Bael, Fokl, Hgal, Mly\ SfaNl and Sthl32I. 
The first, and second restriction sites, if present, utilized throughout the 
various aspects of the present invention may be the same or they may be 
different. In addition, the restriction sites on the same nucleic acid molecule 
(and/or nucleic acid segment) may be the same, or they may be different. The 
present invention also encompasses situations wherein one or both of the 
nucleic acid molecules involved in the various methods are vectors, and where 
one or both of the nucleic acid molecules are linear nucleic acid molecules. 
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The present invention also encompasses the use of other blunt-end cleavage 
enzymes, including, but not limited to, Seal, Smal, Hpal, HincR, HaeU and 
AM. 

[0015] In certain embodiments, the nucleic acids and nucleic acid segments 

utilized in the cloning methods, compositions, kits, and vectors of the present 
invention may optionally comprise one or more selectable markers. Hence, 
the invention also provides such nucleic acids. The one or more selectable 
markers utilized in the present invention may be flanked by one or more (e.g. 
one, two, three, four, five, etc.) restriction sites (e.g. type lis restriction 
enzyme recognition sites). Suitable selectable markers include, but are not 
limited to, genes that confer antibiotic resistance, genes that encode 
fluorescent proteins, tRNA genes, auxotrophic markers, toxic genes, 
phenotypic markers, antisense oligonucleotides, restriction endonucleases, 
restriction endonuclease cleavage sites, enzyme cleavage sites, protein binding 
sites, and sequences complementary to PCR primer sequences. Suitable 
antibiotic resistance genes include, but are not limited to, a chloramphenicol 
resistance gene, an ampicillin resistance gene, a tetracycline resistance gene, a 
Zeocin resistance gene, a spectinomycin resistance gene and a kanamycin 
resistance gene. In certain embodiments of the present invention, the 
selectable marker is a toxic gene. Suitable toxic genes include, but are not 
limited to, a ccdB gene, a gene encoding a tus protein which binds one or 
more ter sites, a kicB gene, a sacB gene , an ASK1 gene, a OX174 E gene and 
a Dpnl gene. In additional embodiments of the methods of the present 
invention, the first and/or second nucleic acid molecules may comprise both 
one or more toxic genes and one or more antibiotic resistance genes, and these 
genes may further be flanked by type lis restriction enzyme recognition sites, 
hi suitable such embodiments of the present invention, the first and/or second 
nucleic acid molecules may comprise both a toxic gene and an antibiotic 
resistance gene. 

[0016] In other aspects of the invention, nucleic acids and/or nucleic acid 

segments for use in the cloning methods, vectors, kits and compositions may 
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fiirther comprise one or more recombination sites and/or one or more 
topoisomerase recognition sites and/or one or more topoisomerases. The 
nucleic acids and/or nucleic acid segments of the present invention may also 
comprise two or more recombination sites- If a topoisomerase recognition site 
is present in a nucleic acid molecule or nucleic acid segment of the present 
invention, it may optionally be flanked by two or more recombination sites. 
Recombination sites suitable for use in the present invention include, but are 
not limited to, attB sites, att? sites, atth sites, attR sites, lox sites, psi sites, 
tnpl sites, dif sites, cer sites, frt sites, and mutants, variants and derivatives 
thereof These one or more recombination sites may flank one or more 
selectable markers, if present, and/or restriction sites (e.g. type lis sites). In 
certain embodiments of the present invention, the topoisomerase recognition 
site, if present, is recognized and bound by a type I topoisomerase, which may . 
be a type IB topoisomerase. Suitable types of type IB topoisomerase include, 
but are not limited to, eukaryotic nuclear type I topoisomerase and poxvirus 
topoisomerase. Suitable types of poxvirus topoisomerase include, but are not 
limited to, poxvirus topoisomerase produced by or isolated from a virus such 
as vaccinia virus, Shope fibroma virus, ORF virus, fowlpox virus, molluscum 
contagiosum virus and Amsacta morrei entomopoxvirus. 
[0017] The present invention also provides methods of linking nucleic acid 

molecules and/or nucleic acid segments which comprise one or more 
topoisomerases bound to one or both termini, wherein the topoisomerase 
adapted terminus or termini comprise a sequence compatible with that cleaved 
by a restriction enzyme (e.g. a type lis restriction enzyme). In such suitable 
embodiments of the invention, a first nucleic acid molecule or nucleic acid 
segment may contain a blunt end to be linked, and a second nucleic acid 
molecule may contain an overhang at the end which is to be linked by a site- 
specific topoisomerase (e.g., a type IA or a type IB topoisomerase), wherein 
the overhang includes a sequence complementary to that comprising the blunt 
end, thereby facilitating strand invasion as a means to properly position the 
ends for the linking reaction. 
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[0018] The nucleic acid molecules generated using this aspect of the invention 

include those in which at least one strand (not both strands) is covalently 
linked at the ends which are joined (e.g. double-stranded nucleic acid 
molecules generated using these methods contain a nick at each position 
where two ends were joined). These embodiments are particularly 
advantageous in that a polymerase can be used to replicate the double-stranded 
(ds) nucleic acid molecule by initially replicating the covalently linked strand. 
For example, a thermostable polymerase such as a polymerase useful for 
performing an amplification reaction such as PCR can be used to replicate the 
covalently strand, whereas the strand containing the nick does not provide a 
suitable template for replication. 

[0019] In certain embodiments of the invention, the first or second nucleic 

acid molecules or nucleic acid segments involved in the various methods of 
the present invention may not comprise a promoter. The present invention 
also allows for transfer of a promoter element into a second nucleic acid 
molecule that may not comprise a promoter, via seamless cloning. In this 
orientation, transcription of the second nucleic acid molecule from the 
promoter element located on the first nucleic acid molecule or nucleic acid 
segment may proceed such that no additional sequences are transcribed 
between the promoter element and the transcription initiation point of the 
second nucleic acid molecule. The present invention also allows for 
seamlessly adding a first nucleic acid molecule or nucleic acid segment into a 
second nucleic molecule that contains a promoter element such that the first 
nucleic acid molecule or segment will subsequently be under the control of the 
promoter element. 

[0020] The present invention also provides methods for cloning or subcloning 

one or more desired nucleic acids: (a) combining in vitro or in vivo, (i) one or 
more first nucleic acid molecules that have one or more sticky ends that have 
been generated by one or more restriction enzymes (e.g. type lis restriction 
enzymes); and (ii) one or more second nucleic acid molecules comprising one 
or more ends which are compatible with the one or more sticky ends on the 
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first nucleic acid molecule(s) and ftirther comprising one or more 
recombination sites; and (b) incubating the combination under conditions 
sufficient to join the first nucleic acid molecule and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. 

[0021] The present invention also provides methods for cloning or subcloning 

one or more desired nucleic acid molecules, or portions thereof, comprising: 
(a) combining in vitro or in vivo, (i) one or more first nucleic acid molecules 
comprising at least one nucleic acid segment that is flanked by one or more 
first restriction sites (e.g. one or more type lis restriction enzyme recognition 
sites); (ii) one or more second nucleic acid molecules comprising one or more 
second restriction sites {e.g. type lis restriction enzyme recogjiition sites) 
flanked by one or more recombination sites; and (iii) one or more restriction 
enzymes (e.g. one or more type lis restriction enzymes) that are specific for 
the first and/or second restriction sites; and (b) incubating the combination 
under conditions sufficient to join the first nucleic acid molecule and one or 
more of the second nucleic acid molecules, thereby producing one or more 
desired product nucleic acid molecules. 

[0022] As described above, the first and/or second nucleic acid molecules 

and/or nucleic acid segments involved in such embodiments of the present 
invention may optionally comprise one or more selectable markers. The first 
and/or second nucleic acid molecules and/or nucleic acid segments involved in 
such aspects of the invention may also, or alternatively comprise one or more 
topoisomerase recognition sites or topoisomerases as described above, and 
optionally or alternatively, two or more recombination sites, which in certain 
such embodiments may flank these topoisomerases or topoisomerase 
recognition sites. 

[0023] The present invention also provides methods for cloning or subcloning 

one or more desired nucleic acid molecules, or portions thereof, via 
recombination cloning comprising: (a) combining, in vitro or in vivo (i) one or 
more first nucleic acid molecules comprising at least one nucleic acid segment 
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that is flanked by one or more restriction sites (e.g. one' or more type lis 
restriction enzyme recognition sites) and that is further flanked by one or more 
recombination sites; (ii) one or more second nucleic acid molecules 
comprising one or more recombination sites; and (iii) one or more site-specific 
recombination proteins; and (b) incubating the combination under conditions 
sufficient to join the first nucleic acid molecule and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. 

[0024] The second nucleic acid molecule involved in such embodiments of the 

invention may also comprise one or more restriction sites (e.g. one or more 
type lis restriction enzyme recognition sites). The first and/or second nucleic 
acids and/or nucleic acid segments involved may also optionally comprise one 
or more selectable markers as described above. The first and/or second 
nucleic acid molecules and/or nucleic acid segments involved in this aspect of 
the invention may also comprise topoisomerase recognition sites or 
topoisomerases as described above, as well as two or more recombination sites 
flanking these topoisomerase sites. 

[0025] Suitable recombination proteins for use in the present invention 

include, but are not limited to, Int, Cre, IHF, Xis, Fis, Hin, Gin, Cin, Tn3 
resolvase, TndX, XerC and XerD. 

[0026] The present invention also provides methods for producing host cells 

comprising one or more of the nucleic acid molecules produced by the cloning 
methods of the present invention Suitable host cells that may be used 
throughout the present invention include, but are not limited to, bacterial cells, 
yeast cells, plant cells and animal cells. The present invention also provides 
methods for producing a subsequent nucleic acid molecule and/or protein by 
expression of the product nucleic acid molecule of the cloning methods of the 
present invention in a host cell. 

[0027] Additional embodiments provide for nucleic acid molecules and 

proteins produced in and isolated from a host cell. In certain such 
embodiments, the nucleic acid molecules produced in the host cell may 
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contain only desired nucleic acid sequences, Le. they may not contain 
extraneous nucleotides, for example, nucleotides encoded by the restriction 
sites (e.g. type lis restriction enzyme recognition sites). Similarly, the proteins 
produced from a host cell by these methods may only contain amino acid 
sequences that correspond to the desired native or mature protein, and may not 
contain extraneous amino acids, for example amino acids encoded by the 
restriction sites (e.g. type lis restriction enzyme recognition sites). Nucleic 
acid molecules produced from a host cell by methods of the present invention 
may be useful as interfering RNA molecules. 
[0028] Another aspect of the present invention provides methods of producing 

an RNA molecule for use as an interfering RNA comprising: (a) optionally, 
identifying one or more target nucleic acid sequences; (b) preparing one or 
more nucleic acid molecules which encode one or more interfering RNAs, 
wherein the interfering RNAs bind to the one or more target nucleic acid 
sequences; (c) combining in vitro or in vivo, (i) the one or more first nucleic 
acid molecules encoding one or more interfering RNAs that have one or more 
sticky ends that have been generated by one or more restriction enzymes (e.g. 
type lis restriction enzymes); and (ii) one or more second nucleic acid 
molecules comprising one or more ends which are compatible with the one or 
more sticky ends on the first nucleic acid molecule(s), and optionally 
comprising one or more selectable markers; and (d) incubating the 
combination under conditions sufficient to join one or more of the nucleic acid 
molecules encoding the interfering RNAs and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules; (e) inserting the one or more product nucleic acid molecules 
into a host cell; and (f) expressing the one or more interfering RNAs in the 
host cell. As in other embodiments of the invention described herein, the 
second nucleic acid molecule may contain an end which is generated by 
digestion with a type lis restriction enzyme and the first nucleic acid molecule 
may contain a compatible end generated by other means. 
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The present invention also provides methods of producing an RNA 
molecule for use as an interfering RNA comprising: (a) optionally, identifying 
one or more target nucleic acid sequences; (b) preparing one or more nucleic 
acid molecules which encode one or more interfering RNAs, wherein the 
interfering RNAs bind to the one or more target nucleic acid sequences; (c) 
combining in vitro or in vivo, (i) the one or more first nucleic acid molecules 
encoding one or more interfering RNAs flanked by one or more first 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
(ii) one or more second nucleic acid molecules comprising one or more second 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites) 
and optionally comprising one or more selectable markers; and (iii) one or 
more site-specific restriction enzymes (e.g. one or more type lis restriction 
enzymes); and (d) incubating the combination under conditions sufficient to 
join one or more of the nucleic acid molecules encoding the interfering RNAs 
and one or more of the second nucleic acid molecules, thereby producing one 
or more desired product nucleic acid molecules; (e) inserting the one or more 
product nucleic acid molecules into a host cell; and (f) expressing the one or 
more interfering RNAs in the host cell. 

In related embodiments, the present invention provides methods of 
producing an RNA molecule for use as an interfering RNA comprising: (a) 
optionally, identifying one or more target nucleic acid sequences; (b) 
preparing one or mo^e nucleic acid molecules which encode one or more 
interfering RNAs, wherein the interfering RNAs bind to the one or more target 
nucleic acid sequences; (c) combining in vitro or in vivo, (i) the one or more 
first nucleic acid ''molecules encoding one or more interfering RNAs that have 
one or more sticky ends that have been generated by one or more restriction 
enzymes (e.g. type lis restriction enzymes); and (ii) one or more second 
nucleic acid molecules comprising one or more ends which are compatible 
with the one or more sticky ends on the first nucleic acid molecule(s), and 
optionally comprising one or more selectable markers; and (d) incubating the 
combination under conditions sufficient to join one or more of the nucleic acid 
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molecules encoding the interfering RNAs and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules; and (e) expressing one or more interfering RNAs in vitro or in 
vivo. In a first further embodiment, the one or more interfering RNAs may be 
produced in vitro or isolaged from a cell and then introduced into a second 
cell. 

[0031] Another aspect of the present invention provides methods of producing 

an RNA molecule for use as an interfering RNA comprising: (a) optionally, 
identifying one or more target nucleic acid sequences; (b) preparing one or 
more nucleic acid molecules which encode one or more interfering RNAs, 
wherein the interfering RNAs bind to the one or more target nucleic acid 
sequences; (c) combining in vitro or in vivo, (i) the one or more first nucleic 
acid molecules encoding one or more interfering RNAs flanked by one or 
more first restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more second restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites) and optionally comprising one or more selectable 
markers; and (iii) one or more site-specific restriction enzymes (e.g. one or 
more type lis restriction enzymes); and (d) incubating the combination under 
conditions sufficient to join one or more of the nucleic acid molecules 
encoding the interfering RNAs and one or more of the second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules; and (e) expressing one or more interfering RNAs in vitro or in 
vivo. In a first further embodiment, the one or more interfering RNAs may be 
produced in vitro or isolaged from a cell and then introduced into a second 
cell. 

[0032] In a related aspect, the present invention provides methods of 

producing an RNA molecule for use as an interfering RNA comprising: (a) 
optionally, identifying one or more target nucleic acid sequences; (b) 
preparing one or more interfering RNAs, wherein the interfering RNAs bind to 
the one or more target nucleic acid sequences; (c) combining in vitro or in 
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vivo % (i) the one or more first nucleic acid molecules comprising one or more 
interfering RNAs that have one or more sticky ends that have been generated 
by one or more restriction enzymes {e.g. type lis restriction enzymes); and (ii) 
one or more second nucleic acid molecules comprising one or more ends 
which are compatible with the one or more sticky ends on the first nucleic acid 
molecule(s), and optionally comprising one or more selectable markers; and 
(d) incubating the combination under conditions sufficient to join one or more 
interfering RNAs and one or more of the second nucleic acid molecules, 
thereby producing one or more desired product nucleic acid molecules; (e) 
inserting the one or more product nucleic acid molecules into a host cell; and 
(f) expressing the one or more interfering RNAs in the host cell. 
[0033] The present invention also provides methods of producing an RNA 

molecule for use as an interfering RNA comprising: (a) optionally, identifying 
one or more target nucleic acid sequences; (b) preparing one or more nucleic 
acid molecules which comprise one or more interfering RNAs, wherein the 
interfering RNAs bind to the one or more target nucleic acid sequences; (c) 
combining in vitro or in vivo, (i) the one or more first nucleic acid molecules 
comprising one or more interfering RNAs flanked by one or more first 
restriction sites {e.g. one or more type lis restriction enzyme recognition sites); 
(ii) one or more second nucleic acid molecules comprising one or more second 
restriction sites {e.g. one or more type lis restriction enzyme recognition sites) 
and optionally comprising one or more selectable markers; and (iii) one or 
more site-specific restriction enzymes {e.g. one or more type lis restriction 
enzymes); and (d) incubating the combination under conditions sufficient to 
join one or more interfering RNAs and one or more of the second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules; (e) inserting the one or more product nucleic acid molecules into a 
host cell; and (f) expressing the one or more interfering RNAs in the host cell. 
[0034] Methods of the present invention may be used, for example, to prepare 

shRNA molecules in which the 5' and 3' termini contain none or few {e.g., 
one, two, three, four, or five) nucleotides which are not encoded by a first 
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nucleic acid molecule referred to throughout Thus, the shRNA may comprise 
from about 40 to about 60 nucleotides in which either none of all but a few 
nucleotides at one or both termini are encoded by a first nucleic acid molecule. 
In such instances j the first nucleic acid molecule may be composed of nucleic 
acid which upon transcription results in the production of RNA with three 
different segments: (1) sense RNA, (2) a loop/non-complementary RNA, and 
(3) antisense RNA. Methods of the invention include introducing into a cell 
(1) (a) nucleic acid which encodes the RNA described above or (b) the RNA 
itself, and (2) the measurement of inhibition of expression of a gene 
corresponding to the sense and/or antisense RNA. 
[0035] In particular embodiments of the invention, the invention may be used 

to produce nucleic acid molecules which produce RNA molecules that do not 
form haiipins. As one example, methods of the invention may be used to 
produce two separate vectors* one or which may be used to produce a sense 
RNA molecules (e.g., a sense RNA molecule which is between about 18 and 
about 30, between about 20 and about 30, between about 22 and about 30, or 
between about 18 and about 25 nucleotides in length) and an antisense RNA 
molecules (e.g. 9 a sense RNA molecule which is between about 18 and about 
30, between about 20 and about 30, between about 22 and about 30, between 
about 18 and about 100, or between about 18 and about 25 nucleotides in 
length), wherein the two RNA molecules are capable of hybridizing to each 
other and/or share a region of sequence complementarity over at least 80%, 
90%, or 95% of their full lengths (e.g., sequence complementarity over a 19 
nucleotide stretch, wherein each molecule is 22 nucleotides in length). 
Alternatively, both sense and antisense RNA molecules, such as described 
above, may be produced by a single vector but as separate transcription 
products. 

[0036] As a variation of the above, the invention may be used to produce 

either sense or antisense RNA molecules alone in cells. These RNA 
molecules may be of any length suitable for the particular application (e.g., 
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expression of protein, antisense inhibition of gene expression, ribozyme 
production, etc.). 

[0037] The invention may further be used to produce microRNA molecules. 

MicroRNA molecules are molecules which are structurally similar to shRNA 
molecules but, typically, contain one or more mismatches or 
insertion/deletions in their regions of sequence complementary. At least some 
microRNA molecules are transcribed as polycistrons of about 400, which are 
then processed to RNA molecules of about 70 nucleotides. These double 
stranded 70 mers are then are processed again, presumably by the enzyme 
Dicer, to two RNA molecules which are about 22 nucleotides in length and 
often have one or more (e.g., one, two, three, four, five, etc.) internal 
mismatches in their regions of sequence complementarity. Lee et al, EMBO 
21:4663-4670 (2002). The invention also includes, for example, uses of 
microRNA molecules and nucleic acid molecules which encode microRNA 
molecules which are similar to the uses described those described herein for 
shRNA and non-hairpin doule stranded RNA molecules. 

[0038] The present invention also provides methods of regulating the 

expression of one or more genes in a cell or an animal using interfering RNA, 
comprising: (a) identifying one or more target nucleic acid sequences; (b) 
preparing one or more nucleic acid molecules which encode one or more 
interfering RNAs, wherein the interfering RNAs bind to the one or more target 
nucleic acid sequences; (c) combining in vitro or in vivo, (i) the one or more 
first nucleic acid molecules encoding one or more interfering RNAs that have 
one or more sticky ends that have been generated by one or more restriction 
enzymes {e.g. type lis restriction enzymes); and (ii) one or more second 
nucleic acid molecules comprising one or more ends which are compatible 
with the one or more sticky ends on the first nucleic acid molecule(s), and 
optionally comprising one or more selectable markers; (d) incubating the 
combination under conditions sufficient to join one or more of the nucleic acid 
molecules encoding the interfering RNAs and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
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acid molecules; and (e) inserting the one or more interfering RNA expression 
vectors into the cell or one or more cells of the animal, under conditions such 
that the one or more interfering RNAs bind to the one or more target nucleic 
acid sequences, thereby regulating expression of the one or more targeted 
genes. 

[0039] In related embodiments, the present invention also provides methods of 

regulating the expression of one or more genes in a cell or an animal using 
interfering RNA, comprising: (a) identifying one or more target nucleic acid 
sequences; (b) preparing one or more nucleic acid molecules which comprise 
one or more interfering RNAs, wherein the interfering RNAs bind to the one 
or more target nucleic acid sequences; (c) combining in vitro or in vivo, (i) the 
one or more first nucleic acid molecules comprising one or more interfering 
RNAs flanked by one or more first restriction sites (e.g. one or more type lis 
restriction enzyme recognition sites); (ii) one or more second, nucleic acid 
molecules comprising one or more second restriction sites (e.g. one or more 
type lis restriction enzyme recognition sites) and optionally comprising one or 
more selectable markers; and (iii) one or more site-specific restriction 
enzymes (e.g. one or more type lis restriction enzymes); (d) incubating the 
combination under conditions sufficient to join one or more interfering RNAs 
and one or more of the second nucleic acid molecules, thereby producing one 
or more desired product nucleic acid molecules; and (e) inserting the one or 
more interfering RNA expression vectors into the cell or one or more cells of 
the animal, under conditions such that the one or more interfering RNAs bind 
to the one or more target nucleic acid sequences, thereby regulating expression 
of the one or more targeted genes. 
[0040] Such methods of the invention can be used to knockout or knockdown 

one or more genes in vivo in a cell or animal. These methods of the invention 
may also be used to produce genetically modified animals by expressing 
interfering RNA in germ cells or somatic cells, and for preparation of 
transgenic animals. 
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[0041] In another embodiment, the present invention also provides isolated 

nucleic acid molecules comprising: (a) one or more sticky ends that have been 
generated by one or more restriction enzymes (e.g. one or more type lis 
restriction enzymes); and (b) optionally one or more selectable markers. The 
present invention also provides isolated nucleic acid molecules comprising: (a) 
one or more restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites); and (b) optionally one or more selectable markers. 

[0042] Suitable restriction enzyme recognition sites and selectable markers are 

described above. The isolated nucleic acid molecules of the present invention 
may also comprise one or more recombination sites and/or one or more 
topoisomerase recognition sites and/or one or more topoisomerases. If 
present, the topoisomerase recognition sites may be flanked by recombination 
sites. The isolated nucleic acid molecules of the present invention may be 
vectors or linear nucleic acid molecules. The present invention also provides 
isolated nucleic acid molecules comprising: (a) one or more sticky ends that 
have been generated by one or more restriction enzymes (e.g. one or more 
type lis restriction enzymes); and (b) one or more recombination sites. The 
present invention further provides isolated nucleic acid molecules comprising: 
(a) one or more restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites); and (b) one or more recombination sites. 

[0043] The present invention also provides vectors comprising: (a) one or 

more desired nucleic acid segments; (b) optionally one or more toxic genes; 
and (c) one or more sites that are compatible with a sticky end generated by a 
restriction enzyme (e.g. one or more type lis restriction enzymes). Suitable 
desired nucleic acid molecules include genes (e.g. open reading frames) and 
promoters. The vectors of the present invention may also comprise one or 
more recombination sites, and one or more topoisomerase recognition sites 
and/or one or more topoisomerases, wherein, the topoisomerase recognition 
sites if present, may be flanked by recombination sites. In other embodiments, 
the vectors of the present invention may optionally comprise one or more 
selectable markers as described above. Suitable vectors of the present 
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invention include, but are not limited to, pENTR/U6-co/B (vector diagram 
shown in Figure 2A, vector sequence in Table 5 and SEQ ID NO:l). 
[0044] The present invention also provides vectors comprising: (a) one or 

more desired nucleic acid segments; (b) optionally one or more toxic genes; 
and (c) one or more restriction sites {e.g. one or more type lis restriction 
enzyme recognition sites). Suitable desired nucleic acid molecules include 
genes and promoters. The vectors of the present invention may also comprise 
one or more recombination sites, and one or more topoisomerase recognition 
sites and/or one or more topoisomerases, wherein, the topoisomerase 
recognition sites if present, may be flanked by recombination sites. In other 
embodiments, the vectors of the present invention may optionally comprise 
one or more selectable markers as described above. Suitable vectors of the 
present invention include, but are not limited to, pENTR/U6-ccrfB (vector 
diagram shown in Figure 2A, vector sequence in Table 5, Figure 12 and SEQ 
IDNO:l). 

[0045] The present inventipn also provides host cells comprising one or more 

of the isolated nucleic acid molecules or nucleic acid segments of the present 
invention, and methods of expressing the isolated nucleic acids of the present 
invention in one more host cells and isolating the expressed nucleic acids. The 
present invention also provides methods of expressing and isolating proteins 
from host cells comprising one or more isolated nucleic acids or nucleic acid 
segments of the invention. 

[0046] Another embodiment of the invention provides methods of expressing 

desired product nucleic acid segments by introducing the nucleic acid 
molecules, nucleic acid segments, or vectors of the present invention into a 
host cell and expressing the product nucleic acid segments. 

[0047] The present invention also provides for compositions comprising: (a) 

one or more first nucleic acid molecules that have one or more sticky ends that 
have been generated by one or more restriction enzymes {e.g. type lis 
restriction enzymes); and (ii) one or more second nucleic acid molecules 
comprising one or more ends which are compatible with the one or more 
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sticky ends on the first nucleic acid molecule(s). The first and second nucleic 
acid molecules may optionally comprise one or more selectable markers as 
discussed above. These first and second nucleic acid molecules may also 
comprise one or more recombination sites, one or more topoisomerase 
recognition sites and/or one or topoisomerases, wherein the topoisomerase 
recognition sites, if present, may be flanked by recombination sites. The 
optional selectable markers may be flanked by type lis restriction sites and/or 
recombination sites. The compositions of the invention may also comprise 
one or more recombination proteins as described above. 

[0048] The present invention further provides for compositions comprising: 

(a) one or more first nucleic acid molecules comprising at least one nucleic 
acid segment that is flanked by one or more first restriction sites (e.g. one or 
more type Us restriction enzyme recognition sites; (b) one or more second 
nucleic acid molecules optionally comprising one or more second restriction 
sites {e.g. one or more type lis restriction enzyme recognition sites); and (c) 
one or more restriction enzymes (e.g. type lis restriction enzymes) that are 
specific for the first and/or second restriction sites. The first and second 
nucleic acid molecules and/or nucleic acid segments may optionally comprise 
one or more selectable markers as discussed above. These first and second 
nucleic acid molecules and/or nucleic acid segments may also comprise one or 
more recombination sites, one or more topoisomerase recognition sites and/or 
one or topoisomerases, wherein the topoisomerase recognition sites, if present, 
may be flanked by recombination sites. The optional selectable markers may 
be flanked by type lis restriction sites and/or recombination sites. The 
compositions of the invention may also comprise one or more recombination 
proteins as described above. 

[0049] The present invention also provides kits comprising the isolated 

nucleic acids or vectors of the present invention. The kits of the present 
invention may further comprise one or more type lis restriction enzymes, one 
or more recombination proteins, and one or more host cells. 
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[0050] Other embodiments of the present invention will be apparent to one or 

ordinary skill in light of the following drawings and description of the 
invention, and of the claims. 

Brief Description of the Drawings 

[0051] Figure 1A is a schematic diagram of a vector of the invention 

comprising: an origin of replication (ori), a kanamycin resistance gen (kan), a 
Polymerase II promoter (polD), LI (attLl) and L2 (attLl) recombination sites, 
an ATG translation initiation site/codon, a secretion signal, type lis restriction 
sites, and a negative selectable marker. 

[0052] Figure IB is a schematic diagram of a vector of the invention 

comprising: an origin of replication (ori), a kanamycin resistance gen (kan), a 
Polymerase II promoter (polIQ, LI (attLl) and L2 (attL2) recombination sites, 
an ATG initiation site/codon, an affinity tag, a cleavage site, a type lis 
restriction site, and a negative selectable marker. 

[0053] Figure 2A is a schematic diagram of pENTR/U6. 

[0054] Figure 2B depicts a Bsal digestion and cloning scheme using 

pENTR/U6. 

[0055] Figures 3A and 3B depict luciferase and p-gal suppression in 

GripTite™ 293 cells by transient cotransfection of reporters and pENTRAJ6 
vectors. A) Luciferase activities measured in lysates of cells: from left 1) 
untransfected, 2) cotransfected with luciferase and lacZ reporter genes plus a 
dummy plasmid (pUC19/actin), or 3-4) same as 2 except either pENTR/U6 
targeting luciferase (GL2-22) or p-gal (lacZ-19) replace the pUC19/actin. B) 
P-gal activity measurements of the same lysates as in A. Activities are the 
average of duplicate wells. The standard error of the mean is indicated for 
each sample. 

[0056] Figure 4A and 4B depicts RNAi of p-Gal and Luciferase activity from 

co'-transfected reporter constructs by pENTR/U6 shRNA clones. Data are 
reported as the ratio of lacZ and Luciferase activity. Error bars are calculated 
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from two independent samples. AS/SA indicates the orientation of the sense 
and anti-sense strand relative to the U6 promoter. A) Luciferase/p-gal activity 
after co-transfection with the indicated pENTR/U6 shRNA sequences 
targeting the Luciferase gene and a pUC19-actin control. pENTR/U6-A6- 
GL2-22(AS) is the same construct used in Figure 3. The asterisk (*) after 
ENTR/U6-A6-GL2-2-SA indicates a point mutation was identified in the 
shRNA target sequence clone used in this experiment. B) p-gal/Luciferase 
activity after co-transfection with various pENTR/U6 shRNA sequences 
targeting the LacZ gene. ENTR/U6- A6-lacZ- 1 9 is the same construct used to 
generate the data presented in Figure 3. 

[0057] Figure 5 depicts p-gal/Luciferase activity ratios after co-transfection 

reporter plasmids and pENTR/U6 LacZ-19 shRNA target clones with the 
indicated Terminator lengths. Terminators with 4, 5, 6 and 8 "Ts" were tested 
in the pENTR/U6.2 vector (A4-8). 

[0058] Figure 6A is a schematic of the lentiviral RNAi shRNA transfer 

vector: pLenti6/RNAi-DEST which is a promoterless Gateway-adapted lenti 
vector which may be used to clone, for example an shRNA cassette of interest 
via Gateway LxR reaction with pENTR U6 vectors. The shRNA cassette will 
often contain an RNA pol HI - or other- promoter of choice to drive hairpin 
expression. The vector confers blasticidin resistance to transduced cells. 

[0059] Figure 6B is a schematic of the lentiviral RNAi Kit control vector: Kit 

control plasmid pLenti6/RNAi/U6-GWAamAC which results from LxR 
reaction between pLenti6/RNAi-DEST and pENTR/U6-lamAC-AS-cgaa. 
pLenti6/RNAi/U6-GW/lamAC expresses lamAC-AS-cgaa hairpin to 
specifically knockdown lamin A/C expression. 

[0060] Figure 7 depicts the inhibition of lamin A/C expression. Lenti6/RNAi 

viruses encoding anti-lamin A/C shRNAs (U6-lamAC) were transduced into 
HeLa cells to test inhibition of lamin A/C expression. Control viruses 
encoded GFP gene (GFP) or anti-luciferase shRNAs (U6-GL2). Western blots 
for lamin A/C or beta-actin were conducted on lysates from transduced cells. 
Top panel: Lysates were prepared 48 hrs post-transduction. Bottom panel: 
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Lysates were prepared from transduced* shRNA-producing, blasticidin- 
resistant cells 5 days post-transduction. 



[0061] 


Figure 8A is a plasmid map of pLenti6/V5-DEST. 


[0062] 


Figure 8B is a plasmid map of pLenti6/V5-gTOPO®. 


[0063] 


Figure 8C is a plasmid map of pLenti4/V5-DEST 


[0064] 


Figure 8D is a plasmid map of pLenti6/UbC/V5-DEST. 


[0065] 


Figure 9A is a plasmid map pLP 1 . 


[0066] 


Figure 9B is a plasmid pLP2. 


[0067] 


Figure 9C is a plasmid map of pLP/VSVG. 


[0068] 


Figure 10 is a plasmid map of pAd/PL-DEST. 


[0069] 


Figure 11 is a plasmid map of pAd/CMV/V5-DEST. 


[0070] 


Figure 12 depicts the nucleic acid sequence of the pENTR/U6 with 



annotations noting the various segments of the vector. SEQ ID NO: 1 



[0071] 


Figure 13 depicts KNAi overview. 


[0072] 


Figure 14 depicts RNAi Mechanistic Model. 


[0073] 


Figure 15 depicts RNAi Methods. 


[0074] 


Figure 16 depicts siRNA Molecules. 


[0075] 


Figure 17 depicts Transfection of siRNAs 


[0076] 


Figure 18 depicts Variation in siRNA effectiveness. 


[0077] 


Figure 19 depicts expression in vivo. 


[0078] 


Figure 20 depicts BLOCK-iT™ Long RNAi Transcription Kit. 


[0079] 


Figure 21 depicts BLOCK-iT™ Dicer RNAi Kit 


[0080] 


Figure 22 depicts d-siRNA knockdown. 


[0081] 


Figure 23 depicts d-siRNA vs. siRNA. 


[0082] 


Figure 24 depicts BLOCK-iT™ RNAi. 


[0083] 


Figure 25 depicts Micro RNA (miRNA). 


[0084] 


Figure 26 depicts RNAi Vectors. 


[0085] 


Figure 27 depicts U6 RNAi. 


[0086] 


Figure 28 depicts Gateway™ Cloning and ViraPower™ RNAI 



cassettes. 

[0087] Figure 29 depicts Selecting a viral expression system. 
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[0088] Figure 30 depicts Outline for lentiviral production. 

[0089] Figure 31 depicts Overview of Lentiviral Production. 

[0090] Figure 32 depicts ViraPower™ lentiviral production. 

[0091] Figure 33 depicts Clone your gene of interest into Lentivirus. 

[0092] Figure 34 depicts Two methods for fast cloning. 

[0093] Figure 35 depicts Two methods for fast cloning. 

[0094] Figure 36 depicts Subcloning an Entry Clone into Multiple 

Destination Vectors. 
[0095] Figure 37 depicts pLenti6/V5 Expression Vectors. 

[0096] Figure 38 depicts GATEWAY Cloning Technology. 

[0097] Figure 39 depicts Assembly of Three DNA segments using Existing 

Entry Clones. 

Detailed Description of the Invention 

Definitions 

i 

[0098] Unless defined otherwise, all technical and scientific terms used herein 

have the same meanings as commonly understood by one of ordinary skill in 
the art to which this invention belongs. 

[0099] One or more: As used herein, the term "one or more" includes at least 

one, more suitably, one, two, three, four, five, ten, twenty, fifty, one-hundred, 
five-hundred, etc., of the item to which "one or more" refers. 

[0100] Nucleic Acid: As used herein, "nucleic acid" refers to polynucleotides 

such as deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The term 
should also be understood to include, as applicable to the embodiment being 
described, single-stranded (such as sense or antisense) and double stranded 
polynucleotides, including double-stranded DNA-RNA hybrids. The term 
"nucleic acid" also is synonymous, and may be used interchangeably with the 
term "nucleic acid molecule." 
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[0101] Gene: As used herein, "gene" refers to a nucleic acid comprising an 

open reading frame encoding a polypeptide, including both exon and 
(optionally) intron sequences. 

[0102] About: As used herein, when referring to any numerical value, "about" 

means a value of ±10% of the stated value (e.g. "about 50°C encompasses a 
range of temperatures from 45°C to 55°C, inclusive: similarly, "about 100 
mM" encompasses a range of concentrations from 90 mM to 110 mM, 
inclusive). 

[0103] Host: As used herein, a "host" is any prokaryotic or eukaryotic 

organism that is a recipient of a replicable expression vector, cloning vector or 
any nucleic acid molecule. The nucleic acid molecule may contain, but is not 
limited to, a structural gene, a transcriptional regulatory sequence (such as a 
promoter, enhancer, repressor, and the like) and/or an origin of replication. As 
used herein, the terms "host," "host cell," "recombinant host" and 
"recombinant host cell" may be used equivalently and interchangeably. For 
examples of such hosts, see Maniatis et al., Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 
(1982). 

[0104] Derivative: As used herein the term "derivative," when used in 

reference to a vector, means that the derivative vector contains one or more 
(e.g., one, two, three, four five, etc.) nucleic acid segments which share 
sequence similar to the vectors represented in Figure 1A, Figure IB, 
Figure 2A, Figure 6A, Figure 6B, Figure 8A, Figure 8B, Figure 8 C, 
Figure 8D, Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, 
Table 5, and any other vector encompassed by the present application. In 
particular embodiments, a derivative vector (1) may be obtained by alteration 
of a vector represented in Figure 1A, Figure IB, Figure 2A, Figure 6A, 
Figure 6B, Figure 8A, Figure 8B, Figure 8C, Figure 8D, Figure 9A, Figure 9B, 
Figure 9C, Figure 10, Figure 11, Figure 12, Table 5, and any other vector 
encompassed by the present application, or (2) may contain one or more 
elements (e.g., antibiotic resistance marker, recombination or restriction site, 
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etc.) of a vector represented in Figure 1 A, Figure IB, Figure 2A, Figure 6A, 
Figure 6B, Figure 8A Figure 8B, Figure 8C, Figure 8D, Figure 9 A, Figure 9B, 
Figure 9C, Figure 10, Figure 11, Figure 12, Table 5, and any other vector 
encompassed by the present application. Further, as noted above, a derivative 
vector may contain one or more element which shares sequence similarity 
(e.g., at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at 
least 95%, etc. sequence identity at the nucleotide level) to one or more 
element of a vector represented in Figure 1A Figure IB, Figure 2A, 
Figure 6A Figure 6B, Figure 8A, Figure 8B, Figure 8C, Figure 8D, 
Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, Table 5, 
and any other vector encompassed by the present application. Derivative 
vectors may also share at least at least 50%, at least 60%, at least 70%, at least 
80%, at least 90%, at least 95%, etc. sequence identity at the nucleotide level 
to the complete nucleotide sequence of a vector represented in Figure 1A, 
Figure IB, Figure 2A, Figure 6A, Figure 6B, Figure 8 A Figure 8B, Figure 8C, 
Figure 8D, Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, 
Table 5, and any other vector encompassed by the present application. 
Derivative vectors include those which have been generated by performing a 
cloning reaction upon a vector represented in Figure 1A, Figure IB, 
Figure 2A, Figure 6 A, Figure 6B, Figure 8 A Figure 8B, Figure 8C, 
Figure 8D, Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, 
Table 5, and any other vector encompassed by the present application. 
Derivative vectors also include vectors which have been generated by the 
insertion into another vector of one or more structural and/or functional 
components of a vector (e.g. one or more genes or portions thereof encoding 
one or more structural or functional proteins (or portions thereof) of a vector), 
including but not limited to the vectors represented in Figure 1 A Figure IB, 
Figure 2A, Figure 6A, Figure 6B, Figure 8A Figure 8B, Figure 8C, 
Figure 8D, Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, 
Table 5, and any other vector encompassed by or suitable for use in the 
invention. Often these derivative vectors will contain at least 50%, at least 
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60%, at least 70%, at least 80%, at least 90%, at least 95%, etc. of the nucleic 
acid present in a vector represented in Figure 1 A, Figure IB, Figure 2A, 
Figure 6A, Figure 6B, Figure 8A, Figure 8B, Figure 8C, Figure 8D, 
Figure 9A, Figure 9B, Figure 9C, Figure 10, Figure 11, Figure 12, Table 5, 
and any other vector encompassed by the present application. Derivative 
vectors also include progeny of any of the vectors referred to above, as well as 
vectors referred to above which have been subjected to mutagenesis (e.g., 
random mutagenesis). 
[0105] Promoter: As used herein, a promoter is an example of a transcriptional 

regulatory sequence, and is specifically a nucleic acid sequence generally 
described as the proximal region of a gene located 5' to the start codon. The 
transcription of an adjacent nucleic acid segment is initiated at the promoter 
region. A repressible promoter's rate of transcription decreases in response to 
a repressing agent. An inducible promoter's rate of transcription increases in 
response to an inducing agent. A constitutive promoter's rate of transcription 
is not specifically regulated, though it can vary under the influence of general 
metabolic conditions. Suitable examples of promoters that may be used in the 
present invention include, but are not limited to polymerase HI promoters such 
as HI and U6. 

[01061 Product: As used herein, a "product" is one of the desired daughter 

molecules produced after cloning process. The product contains the nucleic 
acid which was to be cloned or subcloned. 

[0107] Recognition sequence: As used herein, a "recognition sequence" 

(alternatively and equivalently referred to herein as a recognition site) is a 
particular sequence to which a protein, chemical compound, DNA, or RNA 
molecule (e.g., restriction endonuclease, a topoisomerase, a modification 
methylase, a type lis restriction enzyme, or a recombinase) recognizes and 
binds. In the present invention, a recognition sequence may refer to a 
recombination site (which may alternatively be referred to as a recombinase 
recognition site), a topoisomerase recognition site, or a type lis restriction 
enzyme recognition site. For example, the recognition sequence for Cre 
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recombinase is loxP which is a 34 base pair sequence comprised of two 13 
base pair inverted repeats (serving as the recombinase binding sites) flanking 
an 8 base pair core sequence. See Figure 1 of Sauer, B., Current Opinion in 
Biotechnology 5:521-527 (1994). Other examples of such recognition 
sequences are the attB, atiP, atiL, and attR sequences which are recognized by 
the recombinase enzyme. Integrase atfB is an approximately 25 base pair 
sequence containing two 9 base pair core-type Int binding sites and a 7 base 
pair overlap region. attP is an approximately 240 base pair sequence 
containing core-type Int binding sites and arm-type Int binding sites as well as 
sites for auxiliary proteins integration host factor (IHF), FIS and excisionase 
(Xis). See Landy, Current Opinion in Biotechnology 3:699-707 (1993). Such 
sites may also be engineered according to the present invention to enhance 
production of products in the methods of the invention. When such 
engineered sites lack the PI or HI domains to make the recombination 
reactions irreversible (e.g., attR or atfP), such sites may be designated attR or 
attY to show that the domains of these sites have been modified in some way. 
Examples of topoisomerase recognitions sites include, but are not limited to, 
the sequence S'-GCAACTT-S' that is recognized by E. coli topoisomerase m 
(a type I topoisomerase); the sequence 5'-(C/T)CCTT-3' which is a 
topoisomerase recognition site that is bound specifically by most poxvirus 
topoisomerases, including vaccinia virus DNA topoisomerase I; and others 
that are known in the art as discussed elsewhere herein. 
[0108] Recombination proteins: As used herein, "recombination proteins" 

include excisive or integrative proteins, enzymes, co-factors or associated 
proteins that are involved in recombination reactions involving one or more 
recombination sites, which may be wild-type proteins (See Landy, Current 
Opinion in Biotechnology 3:699-707 (1993)), or mutants, derivatives (e.g., 
fusion proteins containing the recombination protein sequences or fragments 
thereof), fragments, and variants thereof. Suitable recombination proteins for 
use in the present invention include, but are not limited to hit, Cre, IHF, Xis, 
Fis, Hin, Gin, Cin, Tn3 resolvase, TndX, XerC and XerD. 
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Recombination site: As used herein, a "recombination site" is a 
recognition sequence on a nucleic acid molecule participating in an 
integration/recombination reaction by recombination proteins. Recombination 
sites are discrete sections or segments of nucleic acid on the participating 
nucleic acid molecules that are recognized and bound by a site-specific 
recombination protein during the initial stages of integration or recombination. * 
For example, the recombination site for Cre recombinase is loxP which is a 34 
base pair sequence comprised of two 13 base pair inverted repeats (serving as 
the recombinase binding sites) flanking an 8 base pair core sequence. See 
Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521-527 (1994). Other examples 
of recognition sequences include the atiB, atiP, attL, and attR sequences 
described herein, and mutants, fragments, variants and derivatives thereof, 
which are recognized by the recombination protein Int and by the auxiliary 
proteins integration host factor (IHF), FIS and excisionase (Xis). See Landy, 
Curr. Opin. Biotech 3:699-707 (1993). 

Recombinational Cloning: As used herein, "recombinational cloning" 
is a method, such as that described in U.S. Patent Nos. 5,888,732, 6,143,557, 
6,171,861, 6,270,969, and 6,277,608 (the contents of which are fully 
incorporated herein by reference), whereby segments of nucleic acid 
molecules or populations of such molecules are exchanged, inserted, replaced, 
substituted or modified, in vitro or in vivo. Suitably, such cloning method is 
an in vitro method, i.e., a method in which the recombination reaction takes 
place outside of or in the absence of host cells. 

| Selectable marker: As used herein, "selectable marker" is a nucleic 

acid segment that allows one to select for or against a molecule (e.g., a 
replicon) or a cell that contains it, often under particular conditions. These 
markers can encode an activity, such as, but not limited to, production of 
RNA, peptide, or protein, or can provide a binding site for RNA, peptides, 
proteins, inorganic and organic compounds or compositions and the like. 
Examples of selectable markers include but are not limited to: (1) nucleic acid 
segments that encode products which provide resistance against otherwise 
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toxic compounds (e.g., antibiotics); (2) nucleic acid segments that encode 
products which are otherwise lacking in the recipient cell (e.g., tRNA genes, 
auxotrophic markers); (3) nucleic acid segments that encode products which 
suppress the activity of a gene product; (4) nucleic acid segments that encode 
products which can be readily identified (e.g., phenotypic markers such as p- 
galactosidase, green fluorescent protein (GFP), yellow fluorescent protein 
(YFP), cyan fluorescent protein (CFP), and cell surface proteins); (5) nucleic 
acid segments that bind products which are otherwise detrimental to cell 
survival and/or function; (6) nucleic acid segments that otherwise inhibit the 
activity of any of the nucleic acid segments described in Nos. 1-5 above (e.g., 
antisense oligonucleotides); (7) nucleic acid segments that bind products that 
modify a substrate (e.g. restriction endonucleases); (8) nucleic acid segments 
that can be used to isolate or identify a desired molecule (e.g. specific protein 
binding sites); (9) nucleic acid segments that encode a specific nucleotide 
sequence which can be otherwise non-functional (e.g., for PCR amplification 
of subpopulations of molecules); (10) nucleic acid segments, which when 
absent, directly or indirectly confer resistance or sensitivity to particular 
compounds; and/or (11) nucleic acid segments that encode products which are 
toxic in recipient cells. 
[0112] Examples of toxic gene products are well known in the art, and 

include, but are not limited to, restriction endonucleases {e.g., Dpnl), 
apoptosis-related genes {e.g. ASK1 or members of the bcl-2/ced-9 family), 
retroviral genes including those of the human immunodeficiency virus (HIV), 
defensins such as NP-1, inverted repeats or paired palindromic nucleic acid 
sequences, bacteriophage lytic genes such as those from (OX174 or 
bacteriophage T4; antibiotic sensitivity genes such as rpsL, antimicrobial 
sensitivity genes such as pheS, plasmid killer genes, eukaryotic transcriptional 
vector genes that produce a gene product toxic to bacteria, such as GATA-1, 
and genes that kill hosts in the absence of a suppressing function, e.g., kicB, 
ccdB, OX174 E (Liu, Q. et al, Curr. Biol 5:1300-1309 (1998), and other 



-32- 



genes that negatively affect replicon stability and/or replication. A toxic gene 
can alternatively be selectable in vitro, e.g., a restriction site. 
[0113] Selection scheme: As used herein, "selection scheme" is any method 

which allows selection, enrichment, or identification of a desired product or 
product(s). The selection schemes of one suitable embodiment have at least 
two components that are either linked or unlinked during recombinational 
cloning. One component is a Selectable marker. The other component 
controls the expression in vitro or in vivo of the Selectable marker, or survival 
of the cell (or the nucleic acid molecule, e.g., a replicon) harboring the 
plasmid carrying the Selectable marker. Generally, this controlling element 
will be a repressor or inducer of the Selectable marker, but other means for 
controlling expression or activity of the Selectable marker can be used. 
Whether a repressor or activator is used will depend on whether the marker is 
for a positive or negative selection, and the exact arrangement of the various 
nucleic acid segments, as will be readily apparent to those skilled in the art. 

[0114] Fragments of selectable markers can be arranged relative to the 

recombination sites or restriction sites such that when the segments are 
brought together, they reconstitute a functional Selectable marker. For 
example, the linking event can link a promoter with a structural nucleic acid 
molecule (e.g., a gene), can link two fragments of a structural nucleic acid 
molecule, or can link nucleic acid molecules that encode a heterodimeric gene 
product needed for survival, or can link portions of a replicon. 

[0115] Site-specific recombinase: As used herein, a "site specific 

recombinase" is a type of recombinase which typically has at least the 
following four activities (or combinations thereof): (1) recognition of one or 
two specific nucleic acid sequences; (2) cleavage of said sequence or 
sequences; (3) topoisomerase activity involved in strand exchange; and (4) 
ligase activity to reseal the cleaved strands of nucleic acid. See Sauer, B., 
Current Opinions in Biotechnology J:521-527 (1994). Conservative site- 
specific recombination is distinguished from homologous recombination and 
transposition by a high degree of specificity for both partners. The strand 
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exchange mechanism involves the cleavage and rejoining of specific nucleic 
acid sequences in the absence of DNA synthesis (Landy, A. (1989) Ann. Rev. 
Biochem. 55:913-949). 

Vector: As used herein, a "vector" is a nucleic acid molecule 
(preferably DNA) that provides a useful biological or biochemical property to 
an Insert. Examples include plasmids, phages, autonomously replicating 
sequences (ARS), centromeres, and other sequences which are able to 
replicate or be replicated in vitro or in a host cell, or to convey a desired 
nucleic acid segment to a desired location within a host cell. A Vector can 
have one or more restriction endonuclease recognition sites (whether type I, II 
or lis) at which the sequences can be cut in a determinable fashion without 
loss of an essential biological function of the vector, and into which a nucleic 
acid fragment can be spliced in order to bring about its replication and cloning. 
Vectors can also comprise one or more recombination sites that permit 
exchange of nucleic acid sequences between two nucleic acid molecules. 
Such as, for example, subcloning of genes of interest between Entry and 
Destination vectors in the Gateway™ system (available from Invitrogen 
Corporation, Carlsbad, CA (see, e.g., Figure 36)). Vectors can further provide 
primer sites, e.g., for PCR, transcriptional and/or translational initiation and/or 
regulation sites, recombinational signals, replicons, Selectable markers, etc. 
Clearly, methods of inserting a desired nucleic acid fragment which do not 
require the use of recombination, transpositions or restriction enzymes (such 
as, but not limited to, UDG cloning of PCR fragments (U.S. Patent No. 
5,334,575, entirely incorporated herein by reference), TA Cloning® brand 
PCR cloning (Invitrogen Corporation, Carlsbad, CA) (also known as direct 
ligation cloning), and the like) can also be applied to clone a fragment into a 
cloning vector to be used according to the present invention. The cloning 
vector can further contain one or more selectable markers suitable for use in 
the identification of cells transformed with the cloning vector. 
] Incorporating: As used herein, "incorporating" means becoming a part 

of a nucleic acid (e.g., DNA) molecule or primer. 
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[0118] Nucleotide: As used herein, a "nucleotide" is a base-sugax-phosphate 

combination. Nucleotides are monomelic units of a nucleic acid molecule 
(DNA and RNA). The term nucleotide includes ribonucleoside triphosphates 
ATP, UTP, CTG, GTP and deoxyribonucleoside triphosphates such as dATP, 
dCTP, dTTP, dUTP, dGTP, dTTP, or derivatives thereof. The term nucleotide 
as used herein also refers to dideoxyribonucleoside triphosphates (ddNTPs) 
and their derivatives. Illustrated examples of dideoxyribonucleoside 
triphosphates include, but are not limited to, ddATP, ddCTP, ddGTP, ddlTP, 
and ddTTP. According to the present invention, a "nucleotide" may be 
unlabeled or detectably labeled by well known techniques. Detectable labels 
include, for example, radioactive isotopes, fluorescent labels, 
chemiluminescent labels, bioluminescent labels and enzyme labels. 

[0119] Portion: As used herein, the term "portion" refers to part, or percentage 

of a whole entity. For example, a "portion" of a nucleic acid molecule refers 
to 1%, 10%, 25%, 50%, 75%, 90%, 99%, etc., of the whole nucleic acid 
molecule. 

[0120] Segment: As used herein, the term "segment" refers to part, or 

percentage of a whole entity. For example, a "segment" of a nucleic acid 
molecule refers to 1%, 10%, 25%, 50%, 75%, 90%, 99%, etc., of the whole 
nucleic acid molecule. 

[0121] Other terms used in the fields of recombinant nucleic acid technology 

and molecular and cell biology as used herein will be generally understood by 
one of ordinary skill in the applicable arts. 

[0122] The present invention relates to methods, compositions, isolated 

nucleic acids, vectors and kits for seamless cloning of nucleic acid molecules 
and production of nucleic acids and proteins. 

[0123] The vectors represented througout, specifically shown in Figures 1A, 

IB, 2A, 6A and 6B, 8A, 8B, 8C, 8D, 9A, 9B, 9C, 10, 1 1, 28, 33, 37 as well as 
similar vectors and portions of these vectors, may be used in the practice of 
the methods of the present invention. In each case, these vectors are designed 
such that upon digestion with a restriction enzyme (e.g. a type lis restriction 
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enzyme), a sticky end is generated abutting and/or including nucleic acids 
which encode a peptide which may be cleaved from a protein or peptide 
encoded by a nucleic acid which is inserted into the vector. These, and other 
vectors of the present invention may further comprise one or more signal 
peptides and/or protease cleavage sites. The vectors of the present invention 
allow for the production of a protein that is exported from a cell and cleaved to 
generate a "mature" protein. The vectors of the present invention also allow 
for the production of a protein that is retained in the cell as a "native" protein. 
[0124] In one aspect, the present invention provides methods for joining one 

or more (e.g. one, two, three, four, five, etc.) first nucleic acid molecules and a 
second one or more nucleic acid molecules, comprising: (a) combining the 
first and second nucleic acid molecules under conditions sufficient to allow for 
the joining of at least one terminus of the first nucleic acid molecule(s) to at 
least one terminus of the second nucleic acid molecule(s), wherein the 
terminus of the first nucleic acid molecule which is connected to the terminus 
of the second nucleic acid molecule(s) comprises a sticky end (e.g. an 
overhanging end) generated by a restriction enzyme (e.g. a type lis restriction 
enzyme) and the terminus of the second nucleic acid molecule(s) is compatible 
(e.g. a blunt end or a sticky end) with this sticky end. In embodiments similar 
to the above and elsewhere herein, the sticky end my be on the terminus of the 
second nucleic acid molecule and the first nucleic acid molecule may contain a 
compatible end. 

[0125] As in other embodiments of the invention described herein, the second 

nucleic acid molecule may contain an end which is generated by digestion 
with a type lis restriction enzyme and the first nucleic acid molecule may 
contain a compatible end generated by other means. 

[0126] In suitable embodiments, the present invention provides methods of 

cloning or subcloning one or more desired nucleic acid molecules comprising: 
(a) combining in vitro or in vivo, (i) one or more first nucleic acid molecules 
comprising one or more sticky ends that have been generated by one or more 
restriction enzymes (e.g. one or more type Its restriction enzymes); and (ii) 



-36- 



one or more second nucleic acid molecules comprising one or more ends 
which are compatible with the one or more sticky ends on the first nucleic acid 
molecule(s) and, optionally, one or more selectable markers; and (b) 
incubating the combination under conditions sufficient to join the first nucleic 
acid molecule and one or more of the second nucleic acid molecules, thereby 
producing one or more desired product nucleic acid molecules. 

In another aspect, the present invention provides methods for cloning 
or subcloning one or more desired nucleic acid molecules comprising: (a) 
combining in vitro or in vivo, (i) one or more first nucleic acid molecules 
comprising one or more sticky ends that have been generated by one or more 
restriction enzymes (e.g. one or more type lis restriction enzymes); (ii) one or 
more second nucleic acid molecules comprising one or more restriction sites 
(e.g. one or more first type lis restriction enzyme recognition sites) and, 
optionally, one or more selectable markers; and (iii) one or more restriction 
enzymes (e.g., one or more type lis restriction enzymes) that are specific for 
the restriction enzyme recognition site; and (b) incubating the combination 
under conditions sufficient to join the first nucleic acid molecule and one or 
more of the second nucleic acid molecules, thereby producing one or more 
desired product nucleic acid molecules. 

In another aspect, the present invention provides methods for cloning 
, or subcloning one or more desired nucleic acid molecules, or portions thereof, 
comprising: (a) combining in vitro or in vivo, (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment that is flanked by one 
or more restriction sites (e.g. one or more first type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more ends which are compatible with a sticky end on the segment and, 
optionally, one or more selectable markers; and (iii) one or more restriction 
enzymes (e.g., one or more type lis restriction enzymes) that are specific for 
the restriction enzyme recognition site; and (b) incubating the combination 
under conditions sufficient to join the first nucleic acid segment and one or 
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more of the second nucleic acid molecules, thereby producing one or more 
desired product nucleic acid molecules. 
[0129] In another aspect, the present invention provides methods for cloning 

or subcloning one or more desired nucleic acid molecules, or portions thereof, 
comprising: (a) combining in vitro or in vivo, (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment that is flanked by one 
or more first restriction sites (e.g. one or more first type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more second restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites) and, optionally, one or more selectable markers; and 
(iii) one or more restriction enzymes (e.g. one or more type lis restriction 
enzymes) that are specific for the first and/or second type As restriction 
enzyme recognition sites; and (b) incubating the combination under conditions 
sufficient to join the first nucleic acid segment and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. 

[0130] The seamless cloning methods of the present invention may utilize any 

restriction enzyme, including those which cleave nucleic acid molecules to 
produce blunt ends. The term "blunt ends" as used herein is used to indicate a 
nucleic acid molecule which has been cleaved by a restriction enzyme in such 
a way as to produce a double stranded nucleic acid in which both strands stop 
"bluntly" and do not overlap or overhang the other. Suitably, the methods of 
the invention utilize type lis restriction sites. The present invention also 
encompasses the use of blunt-end cleavage enzymes, such as, but not limited 
to, Seal, Smal, Hpal, Hindi, HaeSL and AM. 

[0131] Type-Es restriction enzymes and recognition sites which are useful in 

all aspects of the present invention include, but are not limited to, Earl, MnR, 
Plel, AlwX Bbsl, Bsal, BsmM, BspMI, Esp3I, Hgal, Sap\ 5/aNI, Bbvl, BsrriFl, 
Fokl, BseRI, HphI, Alw26l, BbvU, Bpml, Bsml, Bbsl, BsmBl, Bael, Bsrl, Mfyl, 
BsrDl, EcoSll, Gsul, Mn\l, Plel, Taqll, Tthl 1 in and MboTL In all aspects of 
the present invention, the restriction enzyme recognition sites on the first and 
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second nucleic acid molecules may be the same sites or they may be different. 
In addition, the restriction enzyme recognition sites may be the same or 
different on each nucleic acid molecule. This allows for selective cloning 
where only nucleic acid segments with complementary sites will transfer 
between nucleic acids molecules. 

[0132] Cleavage of a polynucleotide sequence with a type Hs restriction 

enzyme leaves an overhang on one strand of the sequence, or a sticky end. 
Via the cloning methods of the present invention, this sticky end can be 
combined with a compatible sequence on a second nucleic acid molecule 
resulting in a cloned, co-joined molecule. Sequences cleaved by Type lis sites 
may also be joined to blunt ended compatible nucleic acid sequences via the 
cloning methods of the present invention. The compatible sequences can be 
joined via various catalyzing enzymes, for example DNA ligase and 
topoisomerase. Certain type lis enzymes (e.g. MlyT) cleave and leave a blunt 
end on a nucleic acid molecule that may then be combined with a sticky end 
on a second nucleic acid molecule. 

[0133] Nucleic acid molecules of the invention to be cloned may contain a 

blunt end to be linked, and the second nucleic acid molecule involved in the 
cloning method may contain an overhang at the end which is to be linked by a 
site-specific topoisomerase (e.g., a type IA or a type IB topoisomerase), 
wherein the overhang includes a sequence complementary to that comprising 
the blunt end, thereby facilitating strand invasion as . a means to properly 
position the ends for the linking reaction. 

[0134] The nucleic acid molecules generated using this aspect of the invention 

include those in which one strand (not both strands) is covalently linked at the 
ends to be linked (i.e. double-stranded nucleic acid molecules generated using 
these methods contain a nick at each position where two ends were joined). 
These embodiments are particularly advantageous in that a polymerase can be 
used to replicate the double-stranded (ds) nucleic acid molecule by initially 
replicating the covalently linked strand. For example, a thermostable 
polymerase such as a polymerase useful for performing an amplification 
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reaction such as PCR can be used to replicate the covalently strand, whereas 
the strand containing the nick does not provide a suitable template for 
replication. 

[0135] Preferably, the 5 f termini of the ends of the nucleotide sequences to be 

linked by a type IB topoisomerase according to a method of certain aspects of 
the invention contain complementary 5' overhanging sequences, which can 
facilitate the initial association of the nucleotide sequences, including, if 
desired, in a predetermined directional orientation. Alternatively, the 5 1 
termini of the ends of the nucleotide sequences to be linked by a type IB 
topoisomerase according to a method of certain aspects of the invention 
contain complementary 5' sequences wherein one of the sequences contains a 
5' overhanging sequence and the other nucleotide sequence contains a 
complementary sequence at a blunt end of a 5 1 terminus, to facilitate the initial 
association of the nucleotide sequences through strand invasion, including, if 
desired, in a predetermined directional orientation. The term "5' overhang" or 
"5' overhanging sequence" is used herein to refer to a strand of a nucleic acid 
molecule that extends in a 5' direction beyond the terminus of the 
complementary strand of the nucleic acid molecule. Conveniently, a 5' 
overhang can be produced as a result of site specific cleavage of a nucleic acid 
molecule by a type IB topoisomerase. 

. [0136] Preferably, the 3' termini of the ends of the nucleotide sequences to be 

linked by a type IA topoisomerase according to a method of certain aspects of 
the invention contain complementary 3* overhanging sequences, which can 
facilitate the initial association of the nucleotide sequences, including, if 
desired, in a predetermined directional orientation. Alternatively, the 3 1 
termini of the ends of the nucleotide sequences to be linked by a 
topoisomerase (e.g., a type IA or a type II topoisomerase) according to a 
method of certain aspects of the invention contain complementary 3' 
sequences wherein one of the sequences contains a 3' overhanging sequence 
and the other nucleotide sequence contains a complementary sequence at a 
blunt end of a 3' terminus, to facilitate the initial association of the nucleotide 
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sequences through strand invasion, including, if desired, in a predetermined 
directional orientation. The term "3 overhang" or "3 overhanging sequence" is 
used herein to refer to a strand of a nucleic acid molecule that extends in a 5 f 
direction beyond the terminus of the complementary strand of the nucleic acid 
molecule. Conveniently, a 3 ! overhang can be produced upon cleavage by a 
type IA or type II topoisomerase. 
[0137] The cloning methods of the present invention may be performed in 

vitro or in vivo. By in vitro and in vivo herein is meant cloning that is carried 
out outside of host cells (e.g., in cell-free systems, or in systems containing 
host cells in which the various cloning and recombination reaction(s) of the 
present invention take(s) place outside of the host cells) or inside of host cells 
(e.g., using recombination or other proteins expressed by host cells), 
respectively. 

[0138] The nucleic acid molecules utilized and produced in the methods, 

compositions and kits of the present invention may be vectors or linear nucleic 
acid molecules. The term "vector," as used herein, refers to a nucleic acid 
molecule (preferably DNA) that provides a useful biological or biochemical 
property to an inserted nucleic acid. The terms "vector" and "plasmid" are 
used interchangeably herein. Examples of vectors include, phages, 
autonomously replicating sequences (ARS), centromeres, and other sequences 
which are able to replicate or be replicated in vitro or in a cell, or to convey a 
desired nucleic acid segment to a desired location within a cell of an animal. 
Vectors useful in the present invention include chromosomal-, episomal- and 
virus-derived vectors, e.g., vectors derived from bacterial plasmids or 
bacteriophages, and vectors derived from combinations thereof, such as 
cosmids and phagemids. A vector can have one or more restriction 
endonuclease recognition sites at which the sequences can be cut in a 
determinable fashion without loss of an essential biological function of the 
vector, and into which a nucleic acid fragment can be spliced in order to bring 
about its replication and cloning. Vectors can further provide primer sites, 
e.g., for PCR, transcriptional and/or translational initiation and/or regulation 
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sites, recombinational signals, replicons, selectable markers, etc. Clearly, 
methods of inserting a desired nucleic acid fragment which do not require the 
use of homologous recombination, transpositions or restriction enzymes (such 
as, but not limited to, UDG cloning of PCR fragments (U.S. Pat. No. 
5,334,575, entirely incorporated herein by reference), TA Cloning® brand 
PCR cloning (Invitrogen Corp., Carlsbad, Calif.), and the like) can also be 
applied to clone a nucleic acid into a vector to be used according to the present 
invention. The vector can optionally further contain one or more selectable 
markers suitable for use in the identification of cells transformed with the 
vector, such as the selectable markers and reporter genes described herein. 
Vectors of the present invention may be derivative vectors as described 
throughout the present specification. 

[0139] Vectors known in the art and those commercially available (and 

variants or derivatives thereof) may be used in the present invention. Such 
vectors may be obtained from, for example, Vector Laboratories Inc., 
Invitrogen, Promega, Novagen, NEB, Clontech, Boehringer Mannheim, 
Pharmacia, EpiCenter, OriGenes Technologies Inc., Stratagene, PerkinElmer, 
Pharmingen, and Research Genetics. General classes of vectors of particular 
interest include prokaryotic and/or eukaryotic cloning vectors, expression 
vectors, fusion vectors, two-hybrid or reverse two-hybrid vectors, shuttle 
vectors for use in different hosts, mutagenesis vectors, transcription vectors, 
vectors for receiving large inserts and the like. 

[0140] Other vectors of interest include viral origin vectors (Ml 3 vectors, 

bacterial phage X vectors, adenovirus vectors, and retrovirus vectors), high, 
low and adjustable copy number vectors, vectors which have compatible 
replicons for use in combination in a single host (pACYC184 and pBR322) 
and eukaryotic episomal replication vectors (pCDM8). 

[0141] Vectors for use in the present invention may comprise all, or portions 

of viral genomes, for example an adenovirus genome, a baculovirus genome, 
a herpesvirus genome, a pox virus genome, an adeno-associated virus genome, 
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a retrovirus genome, a flavivirus genome, a togavirus genome, an alphavirus 
genome, an RNA virus genome, etc. 

The present invention also encompasses the use of recombinant 
retroviruses, e.g., lentiviruses, or any other type of retrovirus may be used in 
an analogous fashion to practice the present invention. A commercially 
available system for the construction of recombinant lentiviruses is 
ViraPower™ Lentiviral Expression System, available from Invitrogen 
Corporation, Carlsbad, CA. The ViraPower™ system provides a retroviral 
system for high-level expression in dividing and non-dividing eukaryotic cells, 
e.g., mammalian cells (See Figure 29). Examples of products available from 
Invitrogen Corporation, Carlsbad, CA include the ViraPower™ Lentiviral 
Directional TOPO® Expression Kit (catalog number K4950-00), the 
ViraPower™ Lentiviral GATEWAY™ Expression Kit (catalog number 
K4960-00), and the ViraPower™ Lentiviral Support Kit (catalog number 
K4970-00). 

The present invention also encompasses replication-incompetent 
lentiviruses that can deliver and express one or more sequences of interest 
(e.g., genes). These viruses (based loosely on HIV-1) can effectively 
transduce dividing and non-dividing mammalian cells (in culture or in vivo), 
thus broadening the possible applications beyond those of traditional Moloney 
(MLV>based retroviral systems (Clontech, Stratagene, etc.). Directional 
TOPO and GATEWAY™ lentiviral vectors have been created to clone one or 
more genes of interest with a V5 epitope, if desired. The Directional TOPO 
method involves a 5 minute bench-top ligation and results in 95% correct 
orientation (See Figures 33 and 34). The GATEWAY™ method involves 
cloning and sequencing a gene of interest only once into an entry clone and 
rapidly shuttling the gene of interest from vector to vector, or the destination 
clones. The GATEWAY™ method requires no restriction digests, gel 
purification or ligase. The GATEWAY™ method is 90-100% efficient and 
accurate and the gene of interest is cloned in the right direction and in-frame 
(Figure 35). The vectors also carry the blasticidin resistance gene (bsd) to 
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allow for the selection of transduced cells. Without additional modifications, 
these vectors can theoretically accommodate up to ~6 kb of foreign gene. 
Three supercoiled packaging plasmids (gag/pol, rev and VSV-G envelope) are 
provided to supply helper functions and viral proteins in trans (See Figures 30 
and 32). Finally, an optimized producer cell line (293FT) is provided that will 
facilitate production of high titer virus. An Overview of lentiviral production 
is summarized in Figure 31 and involves the following steps: 1) Co-transfect 3 
packaging plasmids and pLenti6-GOI into 293FT; 2) VSV-G envelope 
becomes studded in cell membrane; 3) Rev transports viral genome RNA with 
gene of interest out of the nucleus; 4) gag protein packages: viral RNA mdpol 
protein; 5) Virus buds off cell, picks up envelope (pseudotyping). Plasmid 
maps of vectors adapted for use with GATEWAY™ and topoisomerase 
cloning in the production of nucleic acid molecules comprising all or a portion 
of a lentiviral genome are shown in Figures 8 A (pLenti6/V5-DEST), 8B 
(pLenti6/V5-D-TOPO®), 8C (pLenti4/V5-DEST), and 8D (pLenti6/UbC/V5- 
DEST) respectively. The nucleotide sequences of the plasmids are provided in 
Tables 6-9, SEQ ID NOS:2-5. Plasmid maps of the three packaging plasmids 
pLPl, pLP2, and pLP/VSVG are shown in Figures 9A, 9B, and 9C 
respectively and the nucleotide sequences of these plasmids are provided as 
Tables 10, 11 and 12, (SEQ ID NOS:6-8) respectively. 
1 Retroviruses are RNA viruses that reverse transcribe their genome and 

integrate the DNA copy into a chromosome of the target cell It was 
discovered that the retroviral packaging proteins (gag, pol and env) could be 
supplied in trans, thus allowing the creation of replication incompetent viral 
particles capable of stably delivering a gene of interest. These retroviral 
vectors have been available for gene delivery for many years (Miller et aL, 
(1989) BioTechniques 7:980-990). One significant advantage of retroviral- 
based delivery is that the gene of interest is stably integrated into the genome 
of the host cell with very high efficiency. In addition, no viral genes are 
expressed in these recombinant vectors making them safe to use both in vitro 
and in vivo. However, one main drawback to the traditional Moloney-based 
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retroviruses is that the target cell must undergo one round of cell division for 
nuclear import and stable integration to occur. Traditional retroviruses do not 
have an active mechanism of nuclear import and therefore must wait for the 
host cell nuclear membrane to breakdown during mitosis before they can 
access the host genomic DNA (Miller et al, Mol Cell Biol 1 0:4239-442 
(1990)). 

[0145] Unlike traditional retroviruses, HIV (classified as a lentivirus") is 

actively imported into the nuclei of non-dividing cells (Lewis et ah, J. Virol. 
68:510-516 (1994)). HIV still goes through the basic retrovirus lifecycle 
(RNA genome reverse transcribed in the target cell and integrated into the host 
genome); however, cis-acting elements facilitate active nuclear import, 
allowing HIV to stably infect non-dividing cells (for reviews see 
Buchschacher et al, Blood 95:2499-2504 (2000), Naldini et al, "The 
Development of Human Gene Therapy", Cold Spring Harbor Laboratory 
Press, pages 47-60 (1999)). It is important to note that, for both lentivirus and 
traditional retroviruses, no gene expression occurs until after the viral RNA 
genome has been reverse transcribed and integrated into the host genome. 

[0146] Similar to other retrovirus expression systems, the packaging functions 

of HIV can be supplied in trans, allowing the creation of lentiviral vectors for 
gene delivery. With all the viral proteins removed, the gene delivery vector 
becomes safe to use and allows foreign DNA to be efficiently packaged. In 
addition, it has been shown that lentiviral (or any retroviral) envelope proteins 
can be substituted for ones with broader tropism. The substitution of envelope 
is called pseudotyping, and allows creation of lentiviral vectors capable of 
infecting a wider variety of cells besides just CD4+ cells. Many have found 
that the G protein from vesicular stomatitis virus (VSV-G) is an excellent 
pseudotyping envelope protein that imparts a very broad host range for the 
virus (Yee et al, Proa Natl Acad ScL USA 97:9564-9568 (1994)). The 
ability of pseudo-typed lentivirus to infect a broad range of non-dividing cells 
has led to its extensive use in animal gene delivery and gene therapy (Baek et 
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al, Hum. Gene Ther. 72:1551-8 (2001), Park et al, Mol Ther. 4:164-73 
(2001), Peng et al, Gene Ther. 5:1456-63 (2001)). 

The present invention also encompasses the use of adenoviral vectors, 
including but not limited to, a pAd/PL-DEST vector (Table 11, Figure 10, 
SEQ ID NO:7) and pAd/CMV/V5-DEST vector (Table 12, Figure 11, SEQ ID 
NO:8). Adenoviruses are non-enveloped viruses with a 36 kb DNA genome 
that encodes more than 30 proteins. At the ends of the genome are inverted 
terminal repeats (ITRs) of approximately 100-150 base pairs. A sequence of 
approximately 300 base pairs located next to the S'-ITR is required for 
packaging of the genome into the viral capsid. The genome as packaged in the 
virion has terminal proteins covalently attached to the ends of the linear 
genome. 

The genes encoded by the adenoviral genome are divided into early 
and late genes depending upon the timing of their expression relative to the 
replication of the viral DNA. The early genes are expressed from four regions 
of the adenoviral genome termed E1-E4 and are transcribed prior to onset of 
DNA replication. Multiple genes are transcribed from each region. Portions 
of the adenoviral genome may be deleted without affecting the infectivity of 
the deleted virus. The genes transcribed from regions El, E2, and E4 are 
essential for viral replication while those from the E3 region may be deleted 
without affecting replication. The genes from the essential regions can be 
supplied in trans to allow the propagation of a defective virus. For example, 
deletion of the El region of the adenoviral genome results in a virus that is 
replication defective. Viruses deleted in this region are grown on 293 cells 
that express the viral El genes from the genome of the cell. 
| In addition to permitting the construction of a safer, replication- 

defective viruses, deletion and complementation in trans of portions of the 
adenoviral genome and/or deletion of non-essential regions make space in the 
adenoviral genome for the insertion of heterologous DNA sequences. The 
packaging of viral DNA into a viral particle is size restricted with an upper 
limit of approximately 38 kb of DNA. In order to maximize the amount of 
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heterologous DNA that may be inserted and packaged, viruses have been 
constructed that lack all of the viral genome except the ITRs and packaging 
sequence (see, U.S. patent no. 6,228,646). All of the viral functions necessary 
for replication and packaging are provided in trans from a defective helper 
virus that is deleted in the packaging signal. 

The present invention also encompasses the use of herpes viruses (see, 
for example, U.S. patent no. 5,672,344, issued to Kelly, et al). The family 
Herpesviridae contains three subfamilies 1) alphaherpesvirinae, containing 
among others human herpesvirus 1; 2) betaherpesvirinae, containing the 
cytomegaloviruses; and 3) gammaherpesvirinae. Herpesviruses are enveloped 
DNA viruses. Herpesviruses form particles that are approximately spherical 
in shape and that contain one molecule of linear dsDNA and approximately 20 
structural proteins. Numerous herpesviruses have been isolated from a wide 
variety of hosts. For example, United Patent No. 6,121,043 issued to Cochran, 
et al describes recombinant herpesvirus of turkeys comprising a foreign DNA 
inserted into a non-essential region of the herpesvirus of turkeys genome; 
United States Patent No. 6,410,311 issued to Cochran, et al describes 
recombinant feline herpesvirus comprising a foreign DNA inserted into a 
region corresponding to a 3.0 kb EcoRI-Sall fragment of a feline herpesvirus 
genome, United States Patent No. 6,379,967 issued to Meredith, et al, 
describes herpesvirus saimiri, (HVS; a lymphotropic virus of squirrel 
monkeys) as a viral vector; and United States Patent No. 6,086,902 issued to 
Zamb, et al describes recombinant bovine herpesvirus type 1 vaccines. 

Herpesviruses have been used as vectors to deliver exogenous nucleic 
acid material to a host cell. In addition to the examples above, United States 
Patent No. 4,859,587, issued to Roizman describes recombinant herpes 
simplex viruses, vaccines and methods, United States Patent No. 5,998,208 
issued to Fraefel, et al, describes a helper virus-free herpesvirus vector 
packaging system, United States Patent No. 6,342,229 issued to O'Hare, et aL, 
describes herpesvirus particles comprising fusion protein and their preparation 
and use and United States Patent 6,319,703 issued to Speck describes 
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recombinant virus vectors that include a double mutant herpesvirus such as an 
herpes simplex virus-1 (HSV-1) mutant lacking the essential glycoprotein gH 
gene and having a mutation impairing the function of the gene product VP16. 

[0152] Suitable vectors for use in the present invention also include 

prokaryotic vectors such as pcDNA II, pSL301, pSE280, pSE380, pSE420, 
pTrcHisA, B, and C, pRSET A, B, and C (Invitrogen, Corp.), pGEMEX-1, 
and pGEMEX-2 (Promega, Inc.), the pET vectors (Novagen, Inc.), pTrc99A, 
pKK223-3, the pGEX vectors, pEZZ18, pRIT2T, and pMC1871 (Pharmacia, 
Inc.), pKK233-2 andpKK388-l (Clontech, Inc.), andpProEx-HT (Invitrogen, 
Corp.) and variants and derivatives thereof. Other vectors of interest include 
eukaryotic expression vectors such as pFastBac, pFastBacHT, 
pFastBacDUAL, pSFV, and pTet-Splice (Invitrogen), pEUK-Cl, pPUR, 
pMAM, pMAMneo, pBHOl, pBI121, pDR2, pCMVEBNA, and pYACneo 
(Clontech), pSVK3, pSVL, pMSG, pCHllO, and pKK232-8 (Pharmacia, Inc.), 
p3'SS, pXTl, pSG5, pPbac, pMbac, pMClneo, and pOG44 (Stratagene, Inc.), 
and pYES2, pAC360, pBlueBacffis A, B, and C, pVL1392, pBlueBacDJ, 
pCDM8, pcDNAl, pZeoSV, pcDNA3 pREP4, pCEP4, and pEBVHis 
(Invitrogen, Corp.) and variants or derivatives thereof. 

[0153] Other vectors suitable for use in the invention include pUC18, pUC19, 

pBlueScript, pSPORT, cosmids, phagemids, YAC's (yeast artificial 
chromosomes), BAC's (bacterial artificial chromosomes), PI (Escherichia coli 
N phage), pQE70, pQE60, pQE9 (quagan), pBS vectors, PhageScript vectors, 
BlueScript vectors, pNH8A pNH16A, pNH18A, pNH46A (Stratagene), 
pcDNA3 (Invitrogen), pGEX, pTrsfus, P Trc99A pET-5, pET-9, pKK223-3, 
pKK233-3, pDR540, pRTT5 (Pharmacia), pSPORTl, pSPORT2, 
pCMVSPORT2.0 and pSV-SPORTl (Invitrogen) and variants or derivatives 
thereof. 

[0154] Additional vectors of interest include pTrxFus, pThioHis, pLEX, 

pTrcHis, pTrcHis2, pRSET, pBlueBacffis2, pcDNA3.1/His, 
pcDNA3.1(-)/Myc-His, pSecTag, pEBVHis, pPIC9K, pPIC3.5K, pA0815, 
pPICZ, pPICZa, pGAPZ, pGAPZa, pBlueBac4.5, pBlueBacHis2, pMelBac, 
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pSinRe P 5, pSinffis, pIND, pIND(SPl), pVgRXR, pcDNA2.1, pYES2, 
pZErOl.l, pZErO-2.1, pCR-Blunt, pSE280, pSE380, pSE420, pVL1392, 
pVL1393, pCDM8, pcDNAl.l, pcDNAl.l/Amp, pcDNA3.1, pcDNA3.1/Zeo, 
pSe, SV2, pRc/CMV2, pRc/RSV, pREP4, pREP7, pREP8, pREP9, pREP 10, 
pCEP4, pEBVHis, pCR3.1, pCR2.1, pCR3.1-Uni, and pCRBac from 
frivitrogen; k ExCell, X gtll, pTrc99A, pKK223-3, pGEX-IXT, pGEX-2T, 
pGEX-2TK, pGEX-4T-l, pGEX-4T-2, pGEX-4T-3, pGEX-3X, pGEX-5X-l, 
pGEX-5X-2, pGEX-5X-3, pEZZ18, pRIT2T, pMC1871, pSVK3, pSVL, 
pMSG, pCHHO, pKK232-8, pSL1180, pNEO, and pUC4K from Pharmacia; 
pSCREEN-lb(+), pT7Blue(R), pT7Blue-2, pCITE-4abc(+), pOCUS-2, pTAg, 
pET-32LIC, pET-30LIC, pBAC-2q) LIC, pBACgus-2cp LIC, pT7Blue-2 LIC, 
pT7Blue-2, XSCREEN-1, XBlueSTAR, pET-3abcd, pET-7abc, pET9abcd, 
pETllabcd, pET12abc, pET-14b, pET-15b, pET-16b, pET-17b-pET-17xb, 
pET-19b, pET-20b(+), pET-21abcd(+), pET-22b(+), pET-23abcd(+), pET- 
24abcd(+), pET-25b(+), pET-26b(+), pET-27b(+), pET-28abc(+), pET- 
29abc(+), pET-30abc(+), pET-31b(+), pET-32abc(+), pET-33b(+), pBAC-1, 
pBACgus-1, pBAC4x-l, pBACgus4x-l, pBAC-3cp, pBACgus-2cp, 
pBACsurf-1, pig, Signal pig, pYX, Selecta Vecta-Neo, Selecta Vecta-Hyg, 
and Selecta Vecta-Gpt from Novagen; pLexA, pB42AD, pGBT9, pAS2-l, 
pGAD424, pACT2, pGAD GL, pGAD GH, pGADIO, pGilda, pEZM3, 
pEGFP, pEGFP-1, pEGFP-N, pEGFP-C, pEBFP, pGFPuv, pGFP, p6xffis- 
GFP, pSEAP2-Basic, pSEAP2-Contral, pSEAP2-Promoter, pSEAP2- 
Enhancer, ppgal-Basic, ppgal-Control, ppgal-Promoter, ppgal-Enhancer, 
pCMVp, pTet-Ofi^ pTet-On, pTK-Hyg, pRetro-Off, pRetro-On, pIRESlneo, 
pIRESlhyg, pLXSN, pLNCX, pLAPSN, pMAMneo, pMAMneo-CAT, 
pMAMheo-LUC, pPUR, pSV2neo, pYEX4T-l/2/3, pYEX-Sl, pBacPAK-His, 
pBacPAK8/9, pAcUW31, BacPAK6, pTripffix, XgtlO, Xgtll, pWE15, and 
MriplEx from Clontech; Lambda ZAP II, pBK-CMV, pBK-RSV, pBluescript 
H KS +/-, pBluescript H SK +/-, pAD-GAL4, pBD-GAL4 Cam, pSurfscript, 
Lambda FIX II, Lambda DASH, Lambda EMBL3, Lambda EMBL4, 
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SuperCos, pCR-Scrigt Amp, pCR-Script Cam, pCR-Script Direct, pBS +/-, 
pBC KS +/-, pBC SK +/-, Phagescript, pCAL-n-EK, pCAL-n, pCAL-c, 
pCAL-kc, pET-3abcd, pET-llabcd, pSPUTK, pESP-1, pCMVLacI, 
pOPRSVI/MCS, pOPI3 CAT,pXTl, pSG5, pPbac, pMbac, pMClneo, 
pMClneo Poly A, pOG44, pOG45, pFRTpGAL, pNEOpGAL, pRS403, 
pRS404, pRS405, pRS406, pRS413, pRS414, pRS415, and pRS416 from 
Stratagene. 

[0155] Two-hybrid and reverse two-hybrid vectors of interest include pPC86, 

pDBLeu, pDBTxp, pPC97, p2.5, pGADl-3, pGADIO, pACt, pACT2, 
pGADGL, pGADGH, pAS2-l, pGAD424, pGBT8, pGBT9, pGAD-GAL4, 
pLexA, pBD-GAL4, pHISi, pfflSi-1, placZi, pB42AD, pDG202, pJK202, 
pJG4-5, pNLexA, pYESTrp and variants or derivatives thereof. 

[0156] The present invention also embodies the use and production of 

chimeric vectors. Such chimeric vectors may comprise one or more sequences 
that encode one or more functional or structural component of a viral vector, 
wherein each component may or may not come from the same or different 
types of viruses. Suitable components that may be combined to create such a 
chimeric vector include, but are not limited to, gag, pol, env, and rev genes 
and capsid proteins. 

[0157] The nucleic acid molecules produced and/or utilized in the cloning 

methods, compositions and kits of the present invention may additionally or 
alternatively comprise one or more promoter molecules as described 
throughout the present specification, including the Pol HI promoters HI and 
U6 as well as other promoters recognized by RNA polymerase HI. The 
nucleic acid molecules and vectors of the present invention may also further or 
alternatively comprise one or more genes which code for signal peptides 
and/or protease cleavage sites. Examples of protease cleavage sites include, 
but are not limited to, TEV sites and EK sites. TEV cleavage sites useful in 
the present invention include: 

Consensus sequence: Glu-Xaa-Xaa-Try-Xaa-Gln/ZXaa 1 (SEQ ID NO:23) 
TEV1: Glu-Asn-Leu-Try-Phe-Gln/ZXaa 1 (SEQIDNO:24) 
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TEV2: Glu-Thr-Leu-Tyr-nue-GW/Xaa 1 (SEQ ID NO:25) 
(Xaa = any amino acid; Xaa 1 = any amino acid, except Pro; // = cleavage site). 
[0158] EK cleavage sites useful in the present invention include: 

Asp-Asp-Asp-Asp-Lys// (SEQIDNO:26) 

(// = cleavage site). 

[0159] Signal peptides utilized in the present invention may be removed by a 

signal peptidase or any protease (e.g. Precision, thrombin and factor X) 
specific for one or more motifs on a signal peptide to generate a mature 
protein, including a protein encoded only by the inserted nucleic acid. The 
present invention also encompasses methods for the production of fusion 
proteins, and the fusion proteins produced by those methods. In accordance 
with the present invention, the proteins of the present invention may comprise 
one or more signal peptides, or portions of signal peptides, as noted above. 
These signal peptides may be used to facilitate production of desired proteins 
(e.g. mature or native proteins) in vivo or in vitro. Proteins produced using the 
methods of the present invention comprising such signal peptides would allow 
for the production of mature proteins, in which proteins are exported from the 
cell upon cleavage of the signal peptide by proteases within the cell. In an in 
vitro setting, these signal peptides would facilitate the production of native or 
desired proteins outside of a cell. Cleavage of the signal peptide may occur 
using signal peptidases, such as those described above, thus producing a 
desired protein product. These signal peptides may also be used as tags to 
facilitate affinity purification of polypeptides or proteins, for example fusion 
polypeptides or fusion proteins, produced by the methods of the present 
invention. 

[0160] Any number of different protease recognition sites may be used in the 

practice of the invention. These sites will often be selected by to fit particular 
criteria suitable for the specific application. Exemplary proteases and protease 
recognition sites include the following. Tobacco Etch Virus (TEV) protease 
recognizes the amino acid sequence Glu-Xaa-Xaa-Tyr-Xaa-GlnyVXaa 1 (SEQ 
ID NO:23), where Xaa is any amino acid; Xaa 1 is any amino acid except Pro 
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and // indicates the cleavage site. Thus, for the amino acid sequence Glu-Asn- 
Leu-Tyr-Phe-Gln-Gly (SEQ ID NO:27), TEV cleaves between the Gin and 
Gly residues (see Invitrogen product literature associated with cat nos. 10127- 
017 and 12575-015). Also, for the amino acid sequence Glu-Thr-Leu-Tyr-De- 
Gln-Xaa 1 (SEQ ID NO:25), TEV cleaves between the Gin and Xaa residues. 
Enterokinase (EK) recognizes the amino acid sequence Asp-Asp-Asp-Asp-Lys 
(SEQ ID NO:26) cleaves after the lysine (see Invitrogen product literature 
associated with cat nos. El 80-01 and El 80-02, Invitrogen Corp., Carlsbad, 
CA). The ulpl protease recognizes the amino acid sequence Gly-Gly-Ser 
(SEQ ID NO:28) and cleaves between the second glycine and the serine (U.S. 
Patent Publication No. 2003/0086918). Thus, the invention provides and 
includes nucleic acid molecules which may be used for producing proteins 
which may be processed by TEV protease, EK and/or ulpl protease to 
generate proteins, as well as methods employing these enzymes and proteins 
or peptides produced using these methods. 

[0161] In instances where the protein or peptide which is desired contains an 

amino terminal glycine, an amino terminal tag comprising and/or ending in a 
TEV protease recognition sequence may be used to generate a protein or 
peptides which contains no amino acids associated with, for example, cloning 
sites. Similarly, in instances where the protein which is desired contains an 
amino terminal serine, an amino terminal tag comprising and/or ending in a 
ulp protease recognition sequence may be used to generate a protein or peptide 
which contains amino acids associated with, for example, cloning sites. EK 
may be used to generate proteins or peptides which have an amino terminus 
other than glycine, as well as glycine. 

[0162] The present invention also includes methods for joining two or more 

nucleic acid molecules using methods, for example, described elsewhere 
herein, wherein a first nucleic acid molecule contains a region which encodes 
a protease cleavage site and, optionally, a tag with a second nucleic acid 
molecule encodes a desired protein or peptide. In many instances, these 
nucleic acid segments are connected such that the desired protein is expressed 
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along with amino acids of the protease cleavage site as a fusion protein such 
that upon processing with the cognate protease, the desired protein is 
produced. Often, the desired protein which results from proteolytic digestion 
will contain only amino acids encoded by the second nucleic acid molecule 
referred to above, 

[0163] In many instances, when a desired protein is produced from a nucleic 

acid formed by the connection of two nucleic acid molecules, the generation 
of a "seam" is only relevant with respect to one end of the protein (i.e., the 
amino terminus or the carboxy terminus). In other words, in instances, where 
there is, for example, an amino terminal tag or a carboxy terminal tag, but not 
both, there is only a need to remove one tag. For example, when the 
translation product contains an amino terminal tag, the carboxy terminus of the 
translation product will typically terminate at a position in the mRNA which 
corresponds to the naturally resident stop codon. In such instances, a protease 
system may be used which will only amino terminal amino acids from the 
translation product. 

[0164] The present invention also encompasses the production of a protein 

that comprises an expression enhancing amino acid seiquence cleavable by ulpl 
protease or an active fragment of ulpl protease (for example the fragment from 
amino acid positions 403 to 621) and a poly-amino acid of interest, 
particularly one that is difficult to express in a recombinant expression system. 
The protein may also include a purification tag for ease of isolation. The ulpl 
protease cleavable site may be any ulpl cleavable site, such as for example a 
ulpl protease cleavable site from a ubiquitin-like protein e. g. a SUMO (small 
ubiquitin-like molecule). The SUMO may be, for instance, Smt3 from yeast, 
or a fragment of Smt3 that retains the ability to be recognized and cleaved by 
Ulp 1. Examples of such a fragment of Smt3 include the fragment from amino 
acid positions 14-98 of Smt3 and the fragment from amino acid positions 1-98 
of Smt3. Examples of such proteins can be found in WO 02/090495, the 
entire disclosure of which is incorporated herein by reference. 
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When nucleic acid molecules and/or methods of the invention are used 
to produce proteins or peptides, these proteins or peptides may be produced 
with an amino terminal and/or carboxy terminal tag. These tags may be used 
for any number of purposes, including to (1) increase the stability of the 
protein or peptide or (2) allow for purification. Thus, proteins or peptides 
produced by methods of the invention, as well as protein or peptides encoded 
by nucleic acid molecules of the invention, may contain affinity purification 
tags (e.g., epitope tags such as the V5 epitope). Affinity purification tags are 
often amino acid sequences that can interact with a binding partner 
immobilized on a solid support. Nucleic acids encoding multiple consecutive 
single amino acids, such as histidine, may be used for one-step purification of 
the recombinant protein by affinity binding to a resin column, such as nickel 
sepharose. A protease cleavage site can be engineered between the affinity tag 
and the desired protein to allow for removal of the tag, for example, after the 
purification process is complete or to induce release of the desired protein or 
peptide from the solid support. Affinity tags which may be used in the 
practice of the invention include tags such as the chitin binding domain (which 
binds to chitin), polyarginine, glutathione-S-transferase (which binds to 
glutathione), maltose binding protein (which binds maltose), FlAsH, biotin 
(which binds to avidin and strepavidin), and the like. 

Epitope tags are short amino acid sequences which are recognized by 
epitope specific antibodies. Proteins or peptides which contain one or more 
epitope tags may purified, for example, using a cognate antibody bound to a 
chromatography resin. The presence of the epitope tag furthermore allows the 
recombinant protein to be detected in subsequent assays, such as Western 
blots, without having to produce an antibody specific for the recombinant 
protein itself. Examples of commonly used epitope tags include V5, 
glutathione-S-transferase (GST), hemaglutinin (HA), the peptide Phe-His-His- 
Thr-Thr (SEQ ID NO:29), chitin binding domain, and the like. As discussed 
above, these affinity tags may be removed from the desired protein or peptide 
by proteolytic cleavage. 
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[0167] FlAsH tags comprise the sequence a cys-cys-Xaa-Xaa-cys-cys (SEQ 

ID NO:30), where Xaa and Xaa are amino acids. In many instances, Xaa and 
Xaa, which may be the same or different amino acids, are amino acids with 
high a-helical propensity. In some embodiments, X and Y are the same amino 
acid. These peptides have been shown to bind to biarsenical compounds. The 
FlAsH systems is described in U.S. Patent No. 6,054,271, the entire disclosure 
of which is incorporated herein by reference. 

[0168] The nucleic acid molecules and/or nucleic acid segments utilized in the 

cloning methods, compositions and kits of the present invention may 
optionally comprise one or more selectable markers comprising at least one 
DNA segment encoding an element selected from the group consisting of an 
antibiotic resistance gene, a gene that encodes a fluorescent protein, a tRNA 
gene, an auxotrophic marker, a toxic gene, a phenotypic marker, an antisense 
oligonucleotide, a restriction endonuclease, a restriction endonuclease 
cleavage site, an enzyme cleavage site, a protein binding site, and a sequence 
complementary to a PCR primer sequence. 

[0169] Suitable antibiotic resistance genes for use in the present invention are 

well known in the art and include, but are not limited to, chloramphenicol 
resistance genes, ampicillin resistance genes, tetracycline resistance genes, 
Zeocin resistance genes, spectinomycin resistance genes and kanamycin 
resistance genes. 

[0170] Examples of toxic gene products suitable for use in the present 

invention are well known in the art, and include, but are not limited to, 
restriction endonucleases (e.g., Dpnl), apoptosis-related genes (e.g. ASK1 or 
members of the bcl-2/ced-9 family), retroviral genes including those of the 
human immunodeficiency virus (HIV), defensins such as NP-1, inverted 
repeats or paired palindromic nucleic acid sequences, bacteriophage lytic 
genes such as those from (<DX174 or bacteriophage T4; antibiotic sensitivity 
genes such as rpsL, antimicrobial sensitivity genes such as pheS, plasmid 
killer genes, eukaryotic transcriptional vector genes that produce a gene 
product toxic to bacteria, such as GATA-1, and genes that kill hosts in the 
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absence of a suppressing function, e.g., kicB, sacB, ccdB, (OX174 E (Liu, Q. 
et al, Curr. Biol 5:1300-1309 (1998)), and other genes that negatively affect 
replicon stability and/or replication. The present invention also encompasses 
the use of a gene that encodes the tus gene which binds to one or more ter 
sites. A toxic gene can alternatively be selectable in vitro, e.g., a restriction 
site. 

[0171] Any of the nucleic acid molecules or nucleic acid segments used in or 

produced by the present methods, compositions and kits may further comprise 
one or more site-specific recombination sites. These recombination sites may 
flank the one or more restriction sites {e.g. one or more type lis sites) if 
present in the nucleic acid molecules or segments of the invention. Site- 
specific recombinases are proteins that are present in or produced by many 
organisms (e.g., viruses and bacteria) and have been characterized as having 
both endonuclease and ligase properties. These recombinases (along with 
associated proteins in some cases) recognize specific sequences of bases (i.e., 
recombination sites) in a nucleic acid molecule and exchange the nucleic acid 
segments flanking those sequences. The recombinases and associated proteins 
are collectively referred to as "recombination proteins" (see, e.g., Landy, A., 
Current Opinion in Biotechnology 5:699-707 (1993)). 

[0172] Numerous recombination systems from various organisms have been 

described. See, e.g., Hoess, et al, Nucleic Acids Research 74:2287 (1986); 
Abremski, et al, J. Biol Chem. 261391 (1986); Campbell, J. Bacteriol 
174:7495 (1992); Qian, et al, J. Biol Chem. 267:7794 (1992); Araki, et al, J. 
Mol Biol 225:25 (1992); Maeser and Kahnmann, Mol Gen. Genet. 230:170- 
176) (1991); Esposito, et al, Nucl Acids Res. 25:3605 (1997). Many of these 
belong to the integrase family of recombinases (Argos, et al, EMBO J. 5:433- 
440 (1986); Voziyanov, et al, Nucl Acids Res. 27:930 (1999)). Perhaps the 
best studied of these are the Integrase/att system from bacteriophage ((Landy, 
A. Current Opinions in Genetics and Devel 3:699-707 (1993)), the Cre//o*P 
system from bacteriophage PI (Hoess and Abremski (1990) In Nucleic Acids 
and Molecular Biology, vol. 4. Eds.: Eckstein and Lilley, Berlin-Heidelberg: 
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Springer-Verlag; pp. 90-109), and the FLP/FRT system from the 
Saccharomyces cerevisiae 2 \i circle plasmid (Broach, et al., Cell 29:227-234 
(1982)). 

[0173] Recombination sites are sections or segments of nucleic acid on the 

participating nucleic acid molecules that are recognized and bound by the 
recombination proteins during the initial stages of integration or 
recombination. For example, the recombination site for Cre recombinase is 
lox? which is a 34 base pair sequence comprised of two 13 base pair inverted 
repeats (serving as the recombinase binding sites) fla nk i n g an 8 base pair core 
sequence. See Figure 1 of Sauer, B., Curr. Opin. Biotech. 5:521-527 (1994). 
Other examples of recognition sequences include the attB, atiP, attL, and attR 
sequences which are recognized by the recombination protein Int. attB is an 
approximately 25 base pair sequence containing two 9 base pair core-type Int 
binding sites and a 7 base pair overlap region, while attP is an approximately 
240 base pair sequence containing core-type Int binding sites and arm-type Int 
binding sites as well as sites for auxiliary proteins integration host factor 
(IHF), FIS and excisionase (Xis). SeeLandy, Curr, Opin. Biotech 3:699-707 
(1993). Suitable recombination sites for use in the present invention include, 
but are not limited to, attB sites, att? sites, atth sites, attR sites, lox sites, psi 
sites, tfipl sites, dif sites, cer sites, fit sites, and mutants, variants and 
derivatives thereof. 

[0174] The present cloning methods also embody the use of nucleic acid 

molecules that include a DNA segment having one or more terminal 3'- 
deoxyadenosine monphosphate (dAMP) residues, as described in US Patent 
No. 5,487,933, herein incorporated entirely by reference. These DNA 
segments are generated by thermophilic polymerases during PCR 
amplification. Double-stranded nucleic acids are formed with a single 
overhanging 3'-AMP residue. Mixture of these molecules with a population 
of linear double-stranded DNA molecules with a single overhanging 
deoxythymidylate (dTMP) residue at one or both of the 3' termini of the DNA 
molecule allow for ligation of the 3'-dAMP containing nucleic acid molecules 
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and the 3'-dTMP-containing DNA molecules to produce recombinant 
molecules. This approach is commonly known to those in the art as "TA 
Cloning," compositions and methods for which are available from Invitrogen 
Corporation (Carlsbad, CA). 

The present invention also encompasses the use of cloning methods 
known to those skilled in the art as RecA cloning. The RecA cloning protein 
efficiently coats singly-stranded DNA. In the presence of ATP, this Rec-A 
coated single-stranded DNA can for triple-stranded nucleoprotein complexes 
with homologous double-stranded DNA. This RecA driven strand invasion 
and annealing can lead to high efficiency capture of DNA containing regions 
of homology with single-stranded DNA probes. This system can be used to 
increase the efficiency of recombination between a circular plasmid DNA 
molecule and a linear DNA "insert." Such suitable methods of RecA cloning 
can be found in U.S. Patent Nos. 5,948,653, 6,074,853 and 6,200,812, the 
disclosures of each of which are hereby incorporated entirely by reference. 

The present invention also encompasses the use of a method of cloning 
DNA molecules in cells comprising the steps: a) providing a host cell capable 
of performing homologous recombination, b) contacting in said host cell a first 
DNA molecule which is capable of being replicated in said host cell with a 
second DNA molecule comprising at least two regions of sequence homology 
to regions on the first DNA molecule, under conditions which favour 
homologous recombination between said first and second DNA molecules and 
c) selecting a host cell in which homologous recombination between said first 
and second DNA molecules has occurred. 
[0177] In this method of the present invention, the homologous recombination 

suitably occurs via the recET mechanism, i.e. the homologous recombination 
is mediated by the gene products of the recE and the recT genes which are 
preferably selected from the E. coli genes recE and recT or functionally 
related genes such as the phage % reda and redp genes. In contrast to RecA 
cloning, the recET cloning system requires significantly fewer bases of 
homology for efficient recombination into the target molecule. These proteins 
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facilitate the homologous incorporation of a double-stranded DNA fragment 
into a circular plasmid. 

[0178] A host cell suitable for this embodiment of the present invention is a 

bacterial cell, e.g. a gram-negative bacterial cell. Suitably the host cell is an 
enterobacterial cell, such as Salmonella, Klebsielia or Escherichia. Most 
preferably the host cell is an Escherichia coli cell. It should be noted, 
however, that this method of the present invention is also suitable for 
eukaryotic cells, such a s fungi, plant or animal cells. Such suitable methods 
of recET cloning can be found in Zhang, Y. et al, Nature 20:123-128 (1998), 
Muryers, J.P.P., et al, Nucl. Acids Res. 27:1555-1557 (1999), and U.S. Patent 
Nos. 6,509,156 and 6,355,412, the disclosures of each of which are hereby 
incorporated entirely by reference. 

[0179] The first nucleic acid molecule and/or segment, as well as the second 

nucleic acid molecule involved in the methods, compositions and kits of the 
present invention may further or alternatively comprise one or more 
topoisomerase recognition sites and/or one or more topoisomerases. In 
suitable embodiments, the topoisomerase recognition site(s), if present, may 
optionally be flanked by two or more recombination sites. 

[0180] The term "flanked" as used herein is meant to indicate a spatial 

relationship wherein a restriction site (eg. a type lis site) and/or 
recombination site are located to one side of a nucleic acid segment (gene, 
selectable marker, etc.). As described above, recombination sites may also 
flank restriction sites (e.g. type lis sites) utilized in the invention. In the 
situation where a nucleic acid segment is flanked by two or more 
recombination or recognition sites, each side of the nucleic acid segment may 
be flanked by one or more sites. 

[0181] Topoisomerases are categorized as type I, including type IA and type 

IB topoisomerases, which cleave a single strand of a double stranded nucleic 
acid molecule, and type II topoisomerases (gyrases), which cleave both strands 
of a nucleic acid molecule. Type IA and IB topoisomerases cleave one strand 
of a nucleic acid molecule. Cleavage of a nucleic acid molecule by type IA 
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topoisomerases generates a 5* phosphate and a 3' hydroxyl at the cleavage site, 
with the type IA topoisomerase covalently binding to the 5' terminus of a 
cleaved strand, hi comparison, cleavage of a nucleic acid molecule by type IB 
topoisomerases generates a 3' phosphate and a 5 ! hydroxyl at the cleavage site, 
with the type IB topoisomerase covalently binding to the 3 f terminus of a 
cleaved strand The topoisomerase recognition sites of the present invention, 
if present, may be recognized and bound by a type I topoisomerase, and 
suitably by a type IB topoisomerase. Type IB topoisomerases useful in the 
present invention include, but are not limited to eukaryotic nuclear type I 
topoisomerase and a poxvirus topoisomerase. The poxvirus topoisomerase 
useful in the present invention may be produced by or isolated from a virus 
including, but not limited to, vaccinia virus, Shope fibroma virus, OKF virus, 
fowlpox virus, molluscum contagiosum virus and Ainsacta morrei 
entomopoxvirus (see Shuman, Biochim. Biophys. Acta 1400:321-331 1998; 
Petersen et al., Virology 230:197-206, 1997; Shuman and Prescott, Proc. Natl 
Acad. Set, USA 54:7478-7482, 1987; Shuman, J. Biol Chem. 269:3267%- 
32684, 1994; U.S. Pat. No. 5,766,891; PCT/US95/16099; PCT/US98/12372,, 
each of which is incorporated herein by reference; see, also, Cheng et al, 
supra, 1998). Suitable type IB topoisomerases include the nuclear type I 
topoisomerases present in all eukaryotic cells and those encoded by vaccinia 
and other cellular poxviruses (see Cheng et al, Cell 92:841-850, 1998, which 
is incorporated herein by reference). The eukaryotic type IB topoisomerases 
are exemplified by those expressed in yeast, Drosophila and mammalian cells, 
including human cells (see Caron and Wang, Adv. Pharmacol 2P£,:27 1-297, 
1994; Gupta et al, Biochim. Biophys. Acta 1262:1-14, 1995, each of which is 
incorporated herein by reference; see, also, Berger, supra, 1998). 
[0182] In suitable aspects of the present invention, the one or more optional 

selectable markers of the nucleic acids or segment used in or produced by the 
present invention may be flanked by one or more restriction sites (e.g. one or 
more type lis sites) and/or one or more recombination sites. 
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In other suitable embodiments of the present invention, the first nucleic 
acid molecule or segment and/or the second nucleic acid molecule may not 
comprise a promoter. The present invention allows for transfer of a promoter 
element into a second nucleic acid molecule that may not comprise a promoter 
via seamless cloning. In this orientation, transcription of the second nucleic 
acid molecule from the promoter element located on the first nucleic acid 
molecule may proceed such that no, additional sequences are transcribed 
between the promoter element and the start codon of the second nucleic acid 
molecule. The present invention also allows for seamlessly adding a first 
nucleic acid molecule or segment into a second nucleic molecule that contains 
a promoter element such that the first nucleic acid molecule or segment will 
subsequently be under the control of the promoter element. 

Incubation conditions suitable for use in the methods of the present 
invention comprise incubation with sufficient amounts of DNA ligases and 
buffers. Such incubation conditions are described in Maniatis et al 9 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1982). The term sufficient amount as used 
herein means that the amount of DNA ligase(s) and buffer(s) present during 
the cloning and/or recombination reactions is such that these reactions proceed 
as designed. Suitable buffers include physiologic buffers such as, but not 
limited to, Tris-(hydroxymethyl)aminomethane-HCl TRIS®-HC1, Ethylene- 
diaminetetraacetic acid (EDTA) disodium salt, saline, Phosphate Buffered 
Saline (PBS), N<2-Hydroxyethyl)piperazine-N , -(2-ethanesulfonic acid) 
(HEPES®), 3-(N-Morpholino)propanesulfonic acid (MOPS), 2-bis(2- 
Hydroxyetliylene)amino-2-(hydroxymethyl)-l ,3-propanediol (bis-TRIS®), 
potassium phosphate (KP), sodium phosphate (NaP), dibasic sodium 
phosphate (Na 2 HP0 4 ), monobasic sodium phosphate (NaH 2 P04), monobasic 
sodium potassium phosphate (NaKHP0 4 )> magnesium phosphate 
(Mg3(P0 4 )2-4H 2 0), potassium acetate (CH 3 COOH), D(+)-cc-sodium 
glycerophosphate (HOCH 2 CH(OH)CH 2 OP0 3 Na2) and other physiologic 
buffers known to those skilled in the art. 
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In additional embodiments of the present invention provides methods 
for cloning or subcloning one or more desired nucleic acid molecules 
comprising: (a) combining in vitro or in vivo (i) one or more first nucleic acid 
molecules comprising one or more sticky ends generated by one or more first 
restriction enzymes (e.g. one or more type lis restriction enzymes); (ii) one or 
more second nucleic acid molecules comprising one or more toxic genes 
flanked by one or more second restriction sites (e.g. one or more type lis 
restriction enzyme recognition sites); and (iii) one or more restriction enzymes 
(e.g. one or more type lis restriction enzymes) that are specific for the first 
and/or second restriction sites; and (b) incubating the combination under 
conditions sufficient to join the first nucleic acid molecule and one or more of 
the second nucleic acid molecules, thereby producing one or more desired 
product nucleic acid molecules. Cloning via such methods of the invention 
allows for selection of successfully cloned nucleic acid molecules where the 
toxic gene originally present in the second nucleic acid molecule has been 
removed and replaced with a desired nucleic acid sequence from the first 
nucleic acid molecule. 

In other embodiments of the present invention provides methods for 
cloning or subcloning one or more desired nucleic acid molecules, or portions 
thereof, comprising: (a) combining in vitro or in vivo (i) one or more first 
nucleic acid molecules comprising at least one nucleic acid segment that is 
flanked by one or more first restriction sites (e.g. one or more type lis 
restriction enzyme recognition sites); (ii) one or more second nucleic acid 
molecules comprising one or more toxic genes flanked by one or more second 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
and (iii) one or more restriction enzymes (e.g. one or more type lis restriction 
enzymes) that are specific for the first and/or second restriction enzyme 
recognition sites; and (b) incubating the combination under conditions 
sufficient to join the first nucleic acid molecule and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. As noted above, cloning via such methods of the invention 
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allows for selection of successfully cloned nucleic acid molecules where the 
toxic gene originally present in the second nucleic acid molecule has been 
removed and replaced with a desired nucleic acid sequence from the first 
nucleic acid molecule. 
[0187] The present invention also provides methods for cloning or subcloning 

one or more desired nucleic acid molecules comprising: (a) combining in vitro 
or in vivo (i) one or more first nucleic acid molecules comprising one or more 
sticky ends that have been generated by one or more first restriction enzymes 
(e.g. one or more type lis restriction enzymes); (ii) one or more second nucleic 
acid molecules comprising one or more toxic genes and one or more antibiotic 
resistance genes all flanked by one or more second restriction sites (e.g. one or 
more type lis restriction enzyme recognition sites); and (iii) one or more 
restriction enzymes (e.g. one or more type lis restriction enzymes) that are 
specific for the restriction enzyme recognition sites; and (b) incubating said 
combination under conditions sufficient to join the first nucleic acid molecule 
into and or more of the second nucleic acid molecules, thereby producing one 
or more desired product nucleic acid molecules. This embodiment allows for 
additional selective screening via selection, for example, of antibiotic resistant 
host cells. 

[0188] The present invention also provides methods for cloning or subcloning 

one or more desired nucleic acid molecules, or portions thereof, comprising: 
(a) combining in vitro or in vivo (i) one or more first nucleic acid molecules 
comprising at least one nucleic acid segment flanked by one or more first 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
(ii) one or more second nucleic acid molecules comprising one or more toxic 
genes and one or more antibiotic resistance genes all flanked by one or more 
second restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites); and (iii) one or more restriction enzymes (e.g. one or more 
type lis restriction enzymes) that are specific for the restriction enzyme 
recognition sites; and (b) incubating said combination under conditions 
sufficient to join the first nucleic acid molecule and one or more of the second 
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nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules. This embodiment allows for additional selective screening via 
selection, for example, of antibiotic resistant host cells. 

Another embodiment of the invention provides a method for cloning or 
subcloning one or more desired nucleic acid molecules comprising: (a) 
combining in vitro or in vivo (i) one or more first nucleic acid molecules 
comprising one or more sticky ends that have been generated by one or more 
first restriction enzymes (e.g. one or more type lis restriction enzymes); (ii) 
one or more second nucleic acid molecules comprising one or more second 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites) 
flanked by one or more recombination sites; and (iii) one or more restriction 
enzymes (e.g. one or more type Us restriction enzymes) that are specific for 
the first and/or second restriction enzyme recognition sites; and (b) incubating 
said combination under conditions sufficient to join the first nucleic acid 
molecule and one or more of said second nucleic acid molecules, thereby 
producing one or more desired product nucleic acid molecules. Following 
cloning of the first nucleic acid molecule, the cloned portion of the sequence 
may be cloned into another nucleic acid molecule via, for example, 
recombination cloning as described below. 

| Another embodiment of the invention provides a method for cloning or 

subcloning one or more desired nucleic acid molecules, or portions thereof, 
comprising: (a) combining in vitro or in vivo (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment flanked by one or 
more first restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more second restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites) flanked by one or more recombination sites; and 
(iii) one or more restriction enzymes (e.g. one or more type lis restriction 
enzymes) that are specific for the first and/or second restriction enzyme 
recognition sites; and (b) incubating said combination under conditions 
sufficient to join the first nucleic acid molecule and one or more of said 
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second nucleic acid molecules, thereby producing one or more desired product 
nucleic acid molecules. As noted above, following cloning of the first nucleic 
acid molecule, the cloned portion of the sequence may be cloned into another 
nucleic acid molecule via, for example, recombination cloning as described 
below. 

[0191] The present invention also provides for a method for cloning or 

subcloning one or more desired nucleic acid molecules, or portions thereof, 
comprising: (a) combining in vitro or in vivo (i) one or more first nucleic acid 
molecules comprising at least one nucleic acid segment flanked by one or 
more first restriction sites (e.g. one or more type lis restriction enzyme 
recognition sites) and further flanked by one or more recombination sites; (ii) 
one or more second nucleic acid molecules comprising one or more 
recombination sites; and (iii) one or more site-specific recombination proteins; 
and (b) incubating the combination under conditions sufficient to transfer the 
first nucleic acid molecule into one or more of the second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules. 

[0192] This method of the present invention allows for the transfer of a 

nucleic acid sequence flanked by one or more restriction sites (e.g. one or 
more type lis sites) that is further flanked by one or more recombination sites 
into a second nucleic acid molecule via recombinational cloning. 
Recombinational cloning is described in detail in U.S. Patent Nos. 5,888,732 
and 6,277,608 (incorporated herein entirely by reference in their entireties). 
Recombinational cloning as disclosed in U.S. Patent Nos. 5,888,732 and 
6,277,608 describes methods for moving or exchanging nucleic acid segments 
using at least one recombination site and at least one recombination protein to 
provide chimeric DNA molecules. Suitable recombination proteins for use in 
the present invention include, but are not limited to Int, Cre, IHF, Xis, Fis, 
Hin, Gin, Cin, Tn3 resolvase, TndX, XerC and XerD. 

[0193] The methods of the present invention may further comprise 

introducing the product nucleic acid into one or more host cells. Host cells 
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that may be used in any aspect of the present invention include, but are not 
limited to, bacterial cells, yeast cells, plant cells and animal cells. Preferred 
bacterial host cells include Escherichia spp. cells (particularly coli cells and 
most particularly £. coli strains DH10B, Stbl2, DH5, DB3 (deposit No. NRRL 
B-30098), DB3.1 (preferably E. coli LIBRARY EFFICIENCY7 DB3.U 
Competent Cells; Invitrogen Corporation, Carlsbad, CA), DB4 and DBS 
(deposit Nos. NRRL B-30106 and NNRL B-30107 respectively, see U.S. 
Application No. 09/518,188, filed March 2, 2000, the disclosure of which is 
incorporated by reference herein in its entirety), JDP682 and ccrfA-over (See 
U.S. Provisional Application No 60/475,004, filed June 3, 2003, the disclosure 
of which is incorporated by reference herein in its entirety), Bacillus spp. cells 
(particularly B. subtilis and £. megaterium cells), Streptomyces spp. cells, 
Erwinia spp. cells, Klebsiella spp. cells, Serratia spp. cells (particularly S. 
marcessans cells), Pseudomonas spp. cells (particularly P. aeruginosa cells), 
and Salmonella spp. cells (particularly S. typhimurium and S. typhi cells). 
Preferred animal host cells include insect cells (most particularly Drosophila 
melanogaster cells, Spodoptera frugiperda Sf9 and S£21 cells and Trichoplusa 
High-Five cells), nematode cells (particularly C. elegans cells), avian cells, 
amphibian cells (particularly Xenopus laevis cells), reptilian cells, and 
mammalian cells (most particularly NIH3T3, CHO, COS, VERO, BHK and 
human cells). Preferred yeast host cells include Saccharomyces cerevisiae 
cells and Pichia pastoris cells. These and other suitable host cells are available 
commercially, for example from Invitrogen Corporation (Carlsbad, 
California), American Type Culture Collection (Manassas, Virginia), and 
Agricultural Research Culture Collection (NRRL; Peoria, Illinois). 
[0194] Additional host cells that are useful in the present invention include 

mutant host cells and host cell strains, as well as mutants and/or derivatives 
thereof, that are resistant to the effects of the expression of one or more toxic 
genes. Host cells of this type may, for example, comprise one or more 
mutations in one or more genes within their genomes or on extrachromosomal 
or extragenomic DNA molecules (such as plasmids, phagemids, cosmids, 
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etc.), including mutations in, for example, recA, endA y mcrA, mcrB, mcrC, 
hsd, dedR, tonA, and the like, in particular in recA or endA or in both recA 
and endA. The mutations to these host cells may render the host cells and host 
cell strains resistant to toxic genes including, but not limited to, ccdB> kicB, 
sacB, Dpnl, an apoptosis-related gene, a retroviral gene, a defensin, a 
bacteriophage lytic gene, an antibiotic sensitivity gene, an antimicrobial 
sensitivity gene, a plasmid killer gene, and a eukaryotic transcriptional vector 
gene that produces a gene product toxic to bacteria, and most particularly 
ccdB. Production and use of these type of mutant host cell strains are 
described in commonly owned U.S. Appl. Nos. 60/122,392, filed March 2, 
1999, 09/518,188, filed March 2, 2000 (now abandoned), 10/396,696, filed 
March 20, 2003, and 60/475,004, filed June 3, 2003, the disclosures of which 
are incorporated herein by reference in their entireties. 
[0195] Methods for introducing the cloned product nucleic acid molecules 

and/or vectors of the invention into the host cells described herein, to produce 
host cells comprising one or more of the cloned nucleic acid molecules and/or 
vectors of the invention, will be familiar to those of ordinary skill in the art. 
For instance, the nucleic acid molecules and/or vectors of the invention may 
be introduced into host cells using well known techniques of infection, 
transduction, electroporation, transfection, and transformation. The nucleic 
acid molecules and/or vectors of the invention may be introduced alone or in 
conjunction with other the nucleic acid molecules and/or vectors and/or 
proteins, peptides or RNAs. Alternatively, the nucleic acid molecules and/or 
vectors of the invention may be introduced into host cells as a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a lipid. 
Electroporation also may be used to introduce the nucleic acid molecules 
and/or vectors of the invention into a host. Likewise, such molecules may be 
introduced into chemically competent cells such as E. coli. If the vector is a 
virus, it may be packaged in vitro or introduced into a packaging cell and the 
packaged virus may be transduced into cells. Hence, a wide variety of 
techniques suitable for introducing the nucleic acid molecules and/or vectors 
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of the invention into cells in accordance with this aspect of the invention are 
well known and routine to, those of skill in the art. Such techniques are 
reviewed at length, for example, in Sambrook, J., et al., Molecular Cloning, a 
Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 16.30-16.55 (1989), Watson, JD., et al, Recombinant 
DNA, 2nd Ed., New York: W.H. Freeman and Co., pp. 213-234 (1992), and 
Winnacker, E.-L., From Genes to Clones, New York: VCH Publishers (1987), 
, • which are illustrative of the many laboratory manuals that detail these 
techniques and which are incorporated by reference herein in their entireties 
for their relevant disclosures. 

[0196] The present invention also encompasses producing a subsequent 

nucleic acid and/or a protein by introduction of a cloned product nucleic acid 
molecule of the invention and expression in a host cell. Methods and 
conditions by which to produce such product nucleic acid molecules and 
product proteins are well known in the art. See for example, Sambrook, J., et 
al. 9 Molecular Cloning, a Laboratory Manual, 2nd Ed., Cold Spring Harbor, 
NY: Cold Spring Harbor Laboratory Press (1989). 

[0197] The present invention also encompasses the nucleic acid molecules and 

proteins produced from a host cell of the invention. An improvement of the 
present invention is that nucleic acid molecules produced using methods of the 
present invention, in many instances, will not contain extraneous nucleotides 
that are not associated with the desired nucleic acid, for example nucleotides 
encoded by the restriction sites (e.g. type lis restriction enzyme recognition 
sites). In other words, the seamless cloning methods of the present invention 
allow for a product molecule that does not contain extraneous nucleotides 
from other sources, including the restriction sites. Similarly, the product 
protein molecules produced using the methods of the present invention are 
free of amino acids that are not associated with the desired native or mature 
product protein, for example the product protein molecules are free of amino 
acids encoded by the restriction sites (e.g. type lis restriction sites). The 
proteins produced by the methods of the invention may be of any size, 
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including for example, a short peptide from about 5 amino acids, about 10 
amino acids, about 20 amino acids, about 30 amino acids, about 40 amino 
acids, about 50 amino acids. The present invention also encompasses the 
production of larger proteins, for example about 300 amino acids in length, or 
even a large protein of greater than about 600 amino acids in length. 
[0198] In one embodiment of the present invention, the nucleic acid molecules 

produced from the host cells may be useful as interfering RNA molecules. In 
biological systems that are not amenable to gene targeting or homologous 
recombination, a process called RNA interference (RNAi) is one practical 
method of generating knockout (KO) phenotypes. Post transcriptional gene 
silencing (PTGS) in plants and quelling in Neurospora was described in the 
early 1990s. RNAi was originally described in the model organism C. 
elegans as double stranded RNA (dsRNA) that mediated sequence specific 
gene silencing (Fire et al., Nature 391:806-811 (1998)). RNAi has also been 
described in yeast, Drosophila, plants and trypanosomes. RNAi can be used 
for genetic analysis. For example, it can be used for genome wide RNAi 
screens. RNAi has been shown to be conserved in mammals. RNAi has been 
used in the identification of a short interfering RNA (siRNA) as an effector 
molecule and with microRNA (miRNA) regulation. Essentially, the process 
involves application of double stranded RNA (dsRNA) that represents a 
complementary sense and anti-sense strand of a portion of a target gene within 
the region that encodes mRNA. The presence of the interfering dsRNA causes 
a severe post-transcriptional down-regulation of the target gene. This versatile 
technique has been used as a tool in the study of eukaryotic biology {see 
Sharp, P.A., Genes Dev. 73:139-141 (1999)). RNAi is an evolutionary 
conserved phenomenon and a multi-step process that involves generation of 
active small interfering RNA (siRNA) in vivo through the action of an RNase 
III endonuclease, DICER, which digests long double stranded RNA molecules 
(dsRNA) into shorter fragments (See Figure 13). The 21- to 23-nucleotide 
base pair small interfering RNAs (siRNAs), produced through the action of 
DICER, mediate degradation of the complementary homologous RNA. One 
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bottleneck to using RNAi as a tool has been mRNA target site selection. Yet 
another challenge has been delivery, either transient such as transfection of 
dsRNA (See Figures 16-18)(Kawasaki et al, NAR, 31(3):981-987 (2003)) or 
stable expression using vectors or a virus (See Figures 15 and 19)(Dykxhoorn, 
Novina and Sharp, Nature Reviews, VoL4, (June 2003)). RNAi has 
successfully been reported in stable cell lines and transgenic mice. GFP 
shRNA block GFP expression in transgenic mice, decrease GFP in blastocytes 
and lower GFP fluorescence overall in a three day pup with two copies of the 
shRNA (Tiscornia et. al, PNAS, 2003). 

[0199] RNAi is also powerful in reverse genetics. RNAi can be used as a loss 

of function tool, similar to antisense and ribozymes, but more potent. Natural 
cellular machinery use double stranded RNA to regulate cellular processes 
(e.g., miRNA). Some advantages of RNAi are that it is broadly conserved in 
eukaryotic organisms, is post transciptional (effective in diploids) and is 
tunable (can adj ust level of RNAi at several levels). 

[0200] Until recently, RNAi technology did not appear to be applicable to 

mammalian systems. In mammals, dsRNA activates dsRNA-activated protein 
kinase (PKR) resulting in an apoptotic cascade and cell death (Der et al, Proc. 
Natl. Acad. Set USA P4:3279-3283 (1997)). In addition, it has long been 
known that dsRNA activates the interferon cascade in mammalian cells, which 
can also lead to altered cell physiology (Colby et al, Annu. Rev. Microbiol. 
25:333 (1971); Kleinschmidt et al, Annu. Rev. Biochem. 41:511 (1972); 
Lampson et al., Proc. Natl. Acad. Sci. USA 55L782 (1967); Lomniczi et al., J. 
Gen. Virol. 8:55 (1970); Younger et al, J. Bacteriol. P2:862 (1966)). 
However, dsRNA-mediated activation of the PKR and interferon cascades 
typically require dsRNA longer than about 30 base pairs. Since the primary 
products of DICER are 21-23 base pair fragments of dsRNA, one can 
circumvent the adverse or undesired mammalian responses to dsRNA and still 
elicit an interfering RNA effect via siRNA (Elbashir et al, Nature 411:494- 
498 (2001)). 
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[0201] Thus, another aspect of the present invention provides methods of 

producing an RNA molecule for use as an interfering RNA comprising: (a) 
optionally, identifying one or more target nucleic acid sequences; (b) 
preparing one or more nucleic acid molecules which encode one or more 
interfering RNAs, wherein the interfering RNAs bind to the one or more target 
nucleic acid sequences; (c) combining in vitro or in vivo, (i) the one or more 
first nucleic acid molecules encoding one or more interfering RNAs that have 
one or more sticky ends that have been generated by one or more restriction 
enzymes (e.g. type lis restriction enzymes); and (ii) one or more second 
nucleic acid molecules comprising one or more ends which are compatible 
with the one or more sticky ends on the first nucleic acid molecule(s), and 
optionally comprising one or more selectable markers; and (d) incubating the 
combination under conditions sufficient to join one or more of the nucleic acid 
molecules encoding the interfering RNAs and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules; (e) inserting the one or more product nucleic acid molecules 
into a host cell; and (f) expressing the one or more interfering RNAs in the 
host cell. 

(02021 The present invention also provides methods of producing an RNA 

molecule for use as an interfering RNA comprising: (a) optionally, identifying 
one or more target nucleic acid sequences; (b) preparing one or more nucleic 
acid molecules which encode one or more interfering RNAs, wherein the 
interfering RNAs bind to the one or more target nucleic acid sequences; (c) 
combining in vitro or in vivo, (i) the one or more first nucleic acid molecules 
encoding one or more interfering RNAs flanked by one or more first 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
(ii) one or more second nucleic acid molecules comprising one or more second 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites) 
and optionally comprising one or more selectable markers; and (iii) one or 
more site-specific restriction enzymes (e.g. one or more type lis restriction 
enzymes); and (d) incubating the combination under conditions sufficient to 
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join one or more of the nucleic acid molecules encoding the interfering RNAs 
and one or more of the second nucleic acid molecules, thereby producing one 
or more desired product nucleic acid molecules; (e) inserting the one or more 
product nucleic acid molecules into a host cell; and (f) expressing the one or 
more interfering RNAs in the host cell. 

[0203] In yet another embodiment, the present invention provides methods of 

producing an RNA molecule for use as an interfering RNA comprising: (a) 
optionally, identifying one or more target nucleic acid sequences; (b) 
preparing one or more nucleic acid molecules which encode one or more 
interfering RNAs, wherein the interfering RNAs bind to the one or more target 
nucleic acid sequences; (c) combining in vitro or in vivo, (i) the one or more 
first nucleic acid molecules encoding one or more interfering RNAs that have 
one or more sticky ends that have been generated by one or more restriction 
enzymes (e.g. type lis restriction enzymes); and (ii) one or more second 
nucleic acid molecules comprising one or more ends which are compatible 
with the one or more sticky ends on the first nucleic acid molecule(s), and 
optionally comprising one or more selectable markers; and (d) incubating the 
combination under conditions sufficient to join one or more of the nucleic acid 
molecules encoding the interfering RNAs and one or more of the second 
nucleic acid molecules, thereby producing one or more desired product nucleic 
acid molecules; and (e) expressing one or more interfering RNAs in vitro or in 
vivo. In a first further embodiment, the one or more interfering RNAs may be 
produced in vitro or isolated from a cell and then introduced into a second cell. 

[0204] Another aspect of the present invention provides methods of producing 

an RNA molecule for use as an interfering RNA comprising: (a) optionally, 
identifying one or more target nucleic acid sequences; (b) preparing one or 
more nucleic acid molecules which encode one or more interfering RNAs, 
wherein the interfering RNAs bind to the one or more target nucleic acid 
sequences; (c) combining in vitro or in vivo, (i) the one or more first nucleic 
acid molecules encoding one or more interfering RNAs flanked by one or 
more first restriction sites (e.g. one or more type lis restriction enzyme 
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recognition sites); (ii) one or more second nucleic acid molecules comprising 
one or more second restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites) and optionally comprising one or more selectable 
markers; and (iii) one or more site-specific restriction enzymes (e.g. one or 
more type lis restriction enzymes); and (d) incubating the combination under 
conditions sufficient to join one or more of the nucleic acid molecules 
encoding the interfering RNAs and one or more of the second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules; and (e) expressing one or more interfering RNAs in vitro or in 
vivo. In a first further embodiment, the one or more interfering RNAs may be 
produced in vitro or isolated from a cell and then introduced into a second cell. 

[0205] Another aspect of the present invention provides methods of producing 

an RNA molecule for use as an interfering RNA comprising: (a) optionally, 
identifying one or more target nucleic acid sequences; (b) preparing one or 
more interfering RNAs, wherein the interfering RNAs bind to the one or more 
target nucleic acid sequences; (c) combining in vitro or in vivo, (i) the one or 
more first nucleic acid molecules comprising one or more interfering RNAs 
that have one or more sticky ends that have been generated by one or more 
restriction enzymes (e.g. type lis restriction enzymes); and (ii) one or more 
second nucleic acid molecules comprising one or more ends which are 
compatible with the one or more sticky ends on the first nucleic acid 
molecule(s), and optionally comprising one or more selectable markers; and 
(d) incubating the combination under conditions sufficient to join one or more 
interfering RNAs and one or more of the second nucleic acid molecules, 
thereby producing one or more desired product nucleic acid molecules; (e) 
inserting the one or more product nucleic acid molecules into a host cell; and 
(f) expressing the one or more interfering RNAs in the host cell 

[0206] The present invention also provides methods of producing an RNA 

molecule for use as an interfering RNA comprising: (a) optionally, identifying 
one or more target nucleic acid sequences; (b) preparing one or more nucleic 
acid molecules which comprise one or more interfering RNAs, wherein the 
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interfering RNAs bind to the one or more target nucleic acid sequences; (c) 
combining in vitro or in vivo, (i) the one or more first nucleic acid molecules 
comprising one or more interfering RNAs flanked by one or more first 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
(ii) one or more second nucleic acid molecules comprising one or more second 
restriction sites (e.g. one or more type Us restriction enzyme recognition sites) 
and optionally comprising one or more selectable markers; and (iii) one or 
more site-specific restriction enzymes (e.g. one or more type lis restriction 
enzymes); and (d) incubating the combination under conditions sufficient to 
join one or more interfering RNAs and one or more of the second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules; (e) inserting the one or more product nucleic acid molecules into a 
host cell; and (f) expressing the one or more interfering RNAs in the host cell. 
[0207] Suitable nucleic acid molecules that can function as interfering RNA 

(iRNA) and that can be produced using the methods of the present invention 
may be either single- or double- stranded RNA (ssRNA or dsRNA, 
respectively). Examples of iRNA produced via methods of the present 
invention include, but are not limited to, antisense oligonucleotides, 
ribozymes, small interfering RNAs, double stranded RNAs, inverted repeats, 
short hairpin RNAs, small temporally regulated RNAs and the like. 

Antisense Oligonucleotides 

[0208] In general, antisense oligonucleotides comprise one or more nucleotide 

sequences sufficient in identity, number and size to effect specific 
hybridization with a preselected nucleic. Antisense oligonucleotides produced 
in accordance with the present invention typically have sequences that are 
selected to be sufficiently complementary to the target nucleic sequences 
(suitably mRNA in a target cell or organism) so that the antisense 
oligonucleotide forms a stable hybrid with the mRNA and inhibits the 
translation of the mRNA sequence, preferably under physiological conditions. 
It is preferred but not necessary that the antisense oligonucleotide be 100% 
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complementary to a portion of the target gene sequence. However, the present 
invention also encompasses the production of antisense oligonucleotides with 
a different level of complementarity to the target gene sequence, e.g., 
antisense oligonucleotides that are at least about 50% complementary, at least 
about 55% complementary, at least about 60% complementary, at least about 
65% complementary, at least about 70% complementary, at least about 75% 
complementary, at least about 80% complementary, at least about 85% 
complementary, at least about 90% complementary, at least about 91% 
complementary, at least about 92% complementary, at least about 93% 
complementary, at least about 94% complementary, at least about 95% 
complementary, at least about 96% complementary, at least about 97% 
complementary, at least about 98% complementary, or at least about 99% 
complementary, to the target gene sequence. 
[0209] Antisense oligonucleotides that may be produced in accordance with 

the present invention are well known in the art and that will be familiar to the 
ordinarily skilled artisan. Representative teachings regarding the synthesis, 
design, selection and use of antisense oligonucleotides include without 
limitation U.S. Patent No. 5,789,573, U.S. Patent No. 6,197,584, and 
Ellington, "Current Protocols in Molecular Biology," 2nd Ed., Ausubel et al 9 
eds., Wiley Interscience, New York (1992), the disclosures of which are 
incorporated by reference herein in their entireties. 

Ribozymes 

[0210] In general, ribozymes are RNA molecules having enzymatic activities 

usually associated with cleavage, splicing or ligation of nucleic acid sequences 
to which the ribozyme binds. Typical substrates for ribozymes include RNA 
molecules, although ribozymes may also catalyze reactions in which DNA 
molecules serve as substrates. Two distinct regions can be identified in a 
ribozyme: the binding region which gives the ribozyme its specificity through 
hybridization to a specific nucleic acid sequence, and a catalytic region which 
gives the ribozyme the activity of cleavage, ligation or splicing. Ribozymes 
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which are active intracellularly work in cis, catalyzing only a single turnover, 
and are usually self-modified during the reaction. However, ribozymes can be 
engineered to act in trans, in a truly catalytic manner, with a turnover greater 
than one and without being self-modified. Owing to the catalytic nature of the 
ribozyme, a single ribozyme molecule cleaves many molecules of target 
nucleic acids and therefore therapeutic activity is achieved in relatively lower 
concentrations than those required in an antisense treatment (WO 96/23569). 
[0211] Ribozymes that may be produced in accordance with the present 

invention are well known in the art and that will be familiar to the ordinarily 
skilled artisan. Representative teachings regarding the synthesis, design, 
selection and use of ribozymes include without limitation U.S. Patent No. 
4,987,071, and U.S. Patent No. 5,877,021, the disclosures of all of which are 
incorporated herein by reference in their entireties. 

Small Interfering RNAs (siRNA) 

[0212] RNAi is mediated by double stranded RNA (dsRNA) molecules that 

have sequence-specific homology to their "target" nucleic acid sequences 
(Caplen, N.J., et al., Proc. Natl. Acad. Sci. USA 98:9742-9747 (2001)). 
Biochemical studies in Drosophila cell-free lysates indicate that, in certain 
embodiments of the present invention, the mediators of RNA-dependent gene 
silencing are 21-25 nucleotide "small interfering" RNA duplexes (siRNAs). 
Accordingly, siRNA molecules are suitably used in methods of the present 
invention. The siRNAs are derived from the processing of dsRNA by an 
RNase known as Dicer (Bernstein, E., et al, Nature 409:363-366 (2001)). It 
appears that siRNA duplex products are recruited into a multi-protein siRNA 
complex termed RISC (RNA Induced Silencing Complex). Without wishing 
to be bound by any particular theory, a RISC is then believed to be guided to a 
target nucleic acid (suitably mRNA), where the siRNA duplex interacts in a 
sequence-specific way to mediate cleavage in a catalytic fashion (Bernstein, 
E., et aL Nature 409:363-366 (2001); Boutla, A., et aL, Curr. Biol 77:1776- 
1780(2001)). 
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Small interfering RNAs that may be produced in accordance with the 
present invention are well known in the art and that will be familiar to the 
ordinarily skilled artisan. Small interfering RNAs that may be produced via 
the methods of the present invention suitably comprise between about 1 to 
about 50 nucleotides (nt). For example, siRNAs may comprise about 5 to 
about 40 nt, about 5 to about 30 nt, about 10 to about 30 nt, or about 15 to 
about 30 nt. Longer siRNAs (greater than about 30 nucleotides in length) may 
be useful in some non-human animal systems, and may suitably be produced 
by the methods of the present invention. Most reports describe the use of U6 
or HI pol HI promoters to drive production of siRNA (Lee et al. 9 Nat 
Biotechnol 20:500-505 (2002); Paddison et ai, Genes Dev. 75:948-958 
(2002); Brummelkamp et aL, Science 296:550-553 (2002)). Pol HI promoters 
have all the elements required for initiation of transcription upstream of a 
defined transcription start site and terminate transcription at 4 or more Ts 
(incorporating only 1 or 2 Us into the 3' end of the nascent RNA). These 
attributes allow the production of short RNA molecules with defined ends. 

Inverted Repeats 

Inverted repeats comprise single stranded nucleic acid molecules that 
contain two sequences complementary to each other, oriented such that one of 
the sequences is inverted relative to the other. This orientation allows the two 
complementary sequences to base pair with each other, thereby forming a 
hairpin structure. The two copies of the inverted repeat need not be 
contiguous. There may be "n" additional nucleotides between the hairpin 
forming sequences, wherein "n" is any number of nucleotides. For example, n 
can be about 1, about 5, about 10, about 50, or about 100 nucleotide, or more, 
and can be any number of nucleotides falling within these discrete values. 

Inverted repeats suitable that may be produced in accordance with the 
present invention can be synthesized and used according to procedures that are 
well known in the art and that will be familiar to the ordinarily skilled artisan. 
The production and use of inverted repeats for RNA interference can be found 
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in, without limitation, Kirby, K., et al, Proc. Natl Acad. Sci. USA 99:16162- 
16167 (2002), Adelman, Z. N. f et al, J. Virol 76: 12925-12933 (2002), Yi, C. 
R, et al, J. Biol. Chem. 275:934-939 (2003), Yang, S., et al, Mol Cell Biol 
27:7807-7816 (2001), Svoboda, P., et al, Biochem. Biophys. Res. Commun. 
257:1099-1104 (2001), and Martinek, S. and Young, M. W., Genetics 
255:171-1725 (2000). 

Short Hairpin RNA (shRNA) 

[0216] Paddison, PJ., et al, Genes & Dev. 76:948-958 (2002) have used 

small RNA molecules folded into hairpins as a means to effect RNAi. 
Accordingly, such short hairpin RNA (shRNA) molecules that may be 
produced via the methods of the present invention. Functionally identical to 
the inverted repeats described herein, the length of the stem and loop of 
functional shRNAs distinguishes them from inverted repeats. Stem lengths 
can range from about 1 to about 30 nt, and loop size can range between 1 to 
about 25 nt without affecting silencing activity. While not wishing to be 
bound by any particular theory, it is believed that these shRNAs resemble the 
dsRNA products of the Dicer RNase and, in any event, have the same capacity 
for inhibiting expression of a specific gene. 

[0217] Transcription of shRNAs is initiated at a polymerase HI (pol HI) 

promoter (e.g. U6 and HI promoters) and is believed to be terminated at 
position 2 of a 4-5-thymine transcription termination site. Upon expression, 
shRNAs are thought to fold into a stem-loop structure with 3 f UU-overhangs. 
Subsequently, the ends of these shRNAs are processed, converting the 
shRNAs into -21 nt siRNA-like molecules. 

[0218] Short hairpin RNAs that may be produced in accordance with the 

present invention are well known in the art and that will be familiar to the 
ordinarily skilled artisan. The production and use of inverted repeats for RNA 
interference can be found in, without limitation, Paddison, P J., et al, Genes & 
Dev. 75:948-958 (2002), Yu, J-Y., et al Proc. Natl Acad. Sci. USA 99:6041- 
6052 (2002), and Paul, C. P. etal Nature Biotechnol 20:505-508 (2002). 
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MicroRNAs (mIRNAs) 

The invention may further be used to produce microRNA molecules. 
MicroRNA molecules are molecules which are structurally similar to shRNA 
molecules but, typically, contain one or more mismatches or 
insertion/deletions in their regions of sequence complementary. Hundreds of 
miRNAs have been identified in C. elegans, flies and humans. C. elegans 
miRNA, lin-4 and let-7, have been identified to regulate developmental timing 
and inhibit expression of targeted genes. Examples of miRNA regulation from 
yeast to humans includes regulation of chromatin structure in yeast and tumor 
suppressor genes in humans. At least some microRNA molecules are 
transcribed as polycistrons of about 400, which are then processed to RNA 
molecules of about 70 nucleotides. These double stranded 70 mers are then 
processed again, presumably by the enzyme Dicer, to two RNA molecules 
which are about 22 nucleotides in length and often have one or more (e.g., 
one, two, three, four, five, etc.) internal mismatches in their regions of 
sequence complementarity. (See Figure 25) (Lee et al., EMBO 21:4663-4670 
(2002). The miRNA can enter a miRNA ribonucleoprotein particle (miRNP) 
similar to siRNA entering into the RISC protein complex (Figure 14) 
(Dykxhoorn, Novina and Sharp, Nature Reviews, VoL4, (June 2003)). The 
binding of miRNA/siRNAs of perfect complementarity to a target results in 
mRNA degradation; single base mismatches can block translation. The 
invention also includes, for example, uses of microRNA molecules and 
nucleic acid molecules which encode microRNA molecules which are similar 
to the uses described herein for shRNA and non-hairpin double stranded RNA 
molecules. 

Small Temporally Regulated RNAs (stRNAs) 

Another group of small RNAs that may be produced via the methods 
of the present invention are the small temporally regulated RNAs (stRNAs). 
In general, stRNAs comprise from about 20 to about 30 nt (Banerjee and 
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Slack, Bioessays 24:119-129 (2002)), although stRNAs of any size are also 
suitable for use in accordance with the invention. Unlike siRNAs, stRNAs 
downregulate expression of a target mRNA after the initiation of translation 
without degrading the mRNA. 
[0221] The nucleic acids used in accordance with the present invention can be 

conveniently and routinely made through the well-known technique of solid- 
phase synthesis. Equipment for such synthesis is sold by several vendors 
including, for example, Applied Biosystems (Foster City, Calif.). Other 
methods for such synthesis that are known in the art may additionally or 
alternatively be employed. It is well-known to use similar techniques to 
prepare oligonucleotides such as the phosphorothioates and alkylated 
derivatives. By way of non-limiting example, see, e.g., U.S. Patent No. 
4,517,338, and 4,458,066; Lyer RP, et al 9 Curr. Opin. Mol Ttier. 7:344-358 
(1999); and Verma S, and Eckstein F., Annual Rev. Biochem. 57:99-134 
(1998), the disclosures of all of which are incorporated herein by reference in 
their entireties. 

[0222] The present invention also provides methods for the production of gene 

knockout/knockdown cells and cells lines, as well as genetically modified 
transgenic animals. 

[0223] In such suitable embodiments, the present invention provides methods 

of regulating the expression of one or more genes in a cell or an animal using 
interfering RNA, comprising: (a) identifying one or more target nucleic acid 
sequences; (b) preparing one or more nucleic acid molecules which encode 
one or more interfering RNAs, wherein the interfering RNAs bind to the one 
or more target nucleic acid sequences; (c) combining in vitro or in vivo, (i) the 
one or more first nucleic acid molecules encoding one or more interfering 
RNAs that have one or more sticky ends that have been generated by one or 
more restriction enzymes (e.g. type Us restriction enzymes); and (ii) one or 
more second nucleic acid molecules comprising one or more ends which are 
compatible with the one or more sticky ends on the first nucleic acid 
molecule(s), and optionally comprising one or more selectable markers; (d) 
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incubating the combination under conditions sufficient to join one or more of 
the nucleic acid molecules encoding the interfering RNAs and one or more of 
the second nucleic acid molecules, thereby producing one or more desired 
product nucleic acid molecules; and (e) inserting the one or more interfering 
RNA expression vectors into the cell or one or more cells of the animal, under 
conditions such that the one or more interfering RNAs bind to the one or more 
target nucleic acid sequences, thereby regulating expression of the one or more 
targeted genes. 

[02241 The related embodiments, the present invention also provides methods 

of regulating the expression of one or more genes in a cell or an animal using 
interfering RNA, comprising: (a) identifying one or more target nucleic acid 
sequences; (b) preparing one or more nucleic acid molecules which comprise 
one or more interfering RNAs, wherein the interfering RNAs bind to the one 
or more target nucleic acid sequences; (c) combining in vitro or in vivo, (i) the 
one or more first nucleic acid molecules comprising one or more interfering 
RNAs flanked by one or more first restriction sites (e.g. one or more type lis 
restriction enzyme recognition sites); (ii) one or more second nucleic acid 
molecules comprising one or more second restriction sites (e.g. one or more 
type Us restriction enzyme recognition sites) and optionally comprising one or 
more selectable markers; and (iii) one or more site-specific restriction 
enzymes (e.g. one or more type lis restriction enzymes); (d) incubating the 
combination under conditions sufficient to join one or more interfering RNAs 
and one or more of the second nucleic acid molecules, thereby producing one 
or more desired product nucleic acid molecules; and (e) inserting the one or 
more interfering RNA expression vectors into the cell or one or more cells of 
the animal, under conditions such that the one or more interfering RNAs bind 
to the one or more target nucleic acid sequences, thereby regulating expression 
of the one or more targeted genes. 

[0225] The nucleic acid molecules of the invention can also be used to 

produce transgenic organisms (e.g., animals and plants). Animals of any 
species, including, but not limited to, mice, rats, rabbits, hamsters, guinea pigs, 
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pigs, micro-pigs, goats, sheep, cows and non-human primates (e.g., baboons, 
monkeys, and chimpanzees) may be used to generate transgenic animals. 
Further, plants of any species, including but not limited to Lepidium sativum, 
Brassica juncea, Brassica oleracea, Brassica rapa, Acena sativa, Triticum 
aestivum, Helianthus annuus, Colonial bentgrass, Kentucky bluegrass, 
perennial ryegrass, creeping bentgrass, Bermudagrass, Buffalograss, 
centipedegrass, switch grass, Japanese lawngrass, coastal panicgrass, spinach, 
sorghum, tobacco and corn, may be used to generate transgenic plants. 

Any technique known in the art may be used to introduce nucleic acid 
molecules of the invention into organisms to produce the founder lines of 
transgenic organisms. Such techniques include, but are not limited to, 
pronuclear microinjection (Paterson et al, Appl. Microbiol. Biotechnol. 
40:691-698 (1994); Carver et al, Biotechnology (NY) 77:1263-1270 (1993); 
Wright et al, Biotechnology (NY) 9:830-834 (1991); and Hoppe et al, U.S. 
Pat No. 4,873,191 (1989)); retrovirus mediated gene transfer into germ lines 
(Van der Putten et al, Proc. Natl Acad. ScL, USA 52:6148-6152 (1985)), 
blastocysts or embryos; gene targeting in embryonic stem cells (Thompson et 
al, Cell 55:313-321 (1989)); electroporation of cells or embryos (Lo, Mol 
Cell. Biol 3:1803-1814 (1983)); introduction of the polynucleotides of the 
invention using a gene gun (see, e.g., Ulmer et al, Science 259:1145 (1993); 
introducing nucleic acid constructs into embryonic pluripotent stem cells and 
transferring the stem cells back into the blastocyst; and sperm-mediated gene 
transfer (Lavitrano et al, Cell 57:717-723 (1989); etc. For a review of such 
techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol 775:171-229 
(1989), which is incorporated by reference herein in its entirety. Further, the 
contents of each of the documents recited in this paragraph is herein 
incorporated by reference in its entirety. See also, U.S. Patent No. 5,464,764 
(Capecchi et al, Positive-Negative Selection Methods and Vectors); U.S. 
Patent No. 5,631,153 (Capecchi et al, Cells and Non-Human Organisms 
Containing Predetermined Genomic Modifications and Positive-Negative 
Selection Methods and Vectors for Making Same); U.S. Patent No. 4,736,866 
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(Leder et aL 9 Transgenic Non-Human Animals); and U.S. Patent No. 
4,873,191 (Wagner et aL, Genetic Transformation of Zygotes); each of which 
is hereby incorporated by reference in its entirety. 

Any technique known in the art may be used to produce transgenic 
clones containing nucleic acid molecules of the invention, for example, 
nuclear transfer into enucleated oocytes of nuclei from cultured embryonic, 
fetal, or adult cells induced to quiescence (Campell et aL, Nature 380:64-66 
(1996); Wilmut et aL, Nature 355:810-813 (1997)), each of which is herein 
incorporated by reference in its entirety). 

The present invention provides for transgenic orgapsms that carry 
nucleic acid molecules of the invention in all their cells, as well as organisms 
which carry these nucleic acid molecules, but not all their cells, te., mosaic 
organisms or chimeric. The nucleic acid molecules of the invention may be 
integrated as a single copy or as multiple copies such as in concatamers, e.g., 
head-to-head tandems or head-to-tail tandems. The nucleic acid molecules of 
the invention may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et aL 
(Lasko et aL, Proa NatL Acad. Sci. USA 89:6232-6236 (1992)). The 
regulatory sequences required for such a cell-type specific activation will 
depend upon the particular cell type of interest, and will be apparent to those 
of skill in the art. When it is desired that nucleic acid molecules of the 
invention be integrated into the chromosomal site of the endogenous gene, this 
will normally be done by gene targeting. Briefly, when such a technique is to 
be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function 
of the nucleotide sequence of the endogenous gene. Nucleic acid molecules of 
the invention may also be selectively introduced into a particular cell type, 
thus inactivating the endogenous gene in only that cell type, by following, for 
example, the teaching of Gu et aL (Gu et aL, Science 255:103-106 (1994)). 
The regulatory sequences required for such a cell-type specific inactivation 
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will depend upon the particular cell type of interest, and will be apparent to 
those of skill in the art. The contents of each of the documents recited in this 
paragraph is herein incorporated by reference in its entirety. 

[0229] Once transgenic organisms have been generated, the expression of the 

recombinant gene may be assayed utilizing standard techniques. Initial 
screening may be accomplished by Southern blot analysis or PCR techniques 
to analyze organism tissues to verify that integration of nucleic acid molecules 
of the invention has taken place. The level of mRNA expression of nucleic 
acid molecules of the invention in the tissues of the transgenic organisms may 
also be assessed using techniques which include, but are not limited to, 
Northern blot analysis of tissue samples obtained from the organism, in situ 
hybridization analysis, and reverse transcriptase-PCR (rt-PCR). Samples of 
tissue may which express nucleic acid molecules of the invention also be 
evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the expression product of these nucleic acid molecules. 

[0230] Once the founder organisms are produced, they may be bred, inbred, 

outbred, or crossbred to produce colonies of the particular organism. 
Examples of such breeding strategies include, but are not limited to: 
outbreeding of founder organisms with more than one integration site in order 
to establish separate lines; inbreeding of separate lines in order to produce 
compound transgenic organisms that express nucleic acid molecules of the 
invention at higher levels because of the effects of additive expression of each 
copy of nucleic acid molecules of the invention; crossing of heterozygous 
transgenic organisms to produce organisms homozygous for a given 
integration site in order to both augment expression and eliminate the need for 
screening of organisms by DNA analysis; crossing of separate homozygous 
lines to produce compound heterozygous or homozygous lines; and breeding 
to place the nucleic acid molecules of the invention on a distinct background 
that is appropriate for an experimental model of interest. 

[0231] Transgenic and "knock-out" organisms of the invention have uses 

which include, but are not limited to, model systems (e.g., animal model 
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systems) useful in elaborating the biological function of expression products 
of nucleic acid molecules of the invention, studying conditions and/or 
disorders associated with aberrant expression of expression products of 
nucleic acid molecules of the invention, and in screening for compounds 
effective in ameliorating such conditions and/or disorders. 
[0232] As one skilled in the art would recognize, in many instances when 

nucleic acid molecules of the invention are introduced into metazoan 
organisms, it will be desirable to operably link sequences which encode 
expression products to tissue-specific transcriptional regulatory sequences 
(e.g., tissue-specific promoters) where production of the expression product is 
desired. Such promoters can be used to facilitate production of these 
expression products in desired tissues, A considerable number of tissue- 
specific promoters are known in the art. Further, methods for identifying 
tissue-specific transcriptional regulatory sequences are described elsewhere 
, herein. 

[0233] The present invention also provides isolated nucleic acids comprising: 

(a) one or more sticky ends that have been generated by one or more 
restriction enzymes (e.g. one or more type lis restriction enzymes); and (b) 
optionally one or more selectable markers. The present invention furtther 
provides isolated nucleic acids comprising: (a) one or more restriction sites 
(e.g. one or more type lis restriction enzyme recognition sites); and (b) 
optionally one or more selectable markers. As noted above, selectable 
markers for use in the isolated nucleic acids of the present invention comprise 
antibiotic resistance genes and toxic genes. As also described above, the 
isolated nucleic acids molecules of the present invention may also comprise 
one or more recombination sites, and one or more topoisomerase recognition 
sites and/or one or more topoisomerases. In suitable embodiments, the 
topoisomerase recognition site, if present, may optionally be flanked by two or 
more recombination sites. 
- [0234] In another embodiment, the present invention provides isolated nucleic 

acids comprising: (a) one or more sticky ends that have been generated by one 
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or more restriction enzymes (e.g. one or more type lis restriction enzymes); 
and (b) one or more recombination sites. In yet another embodiment, the 
present invention provides isolated nucleic acids comprising: (a) one or more 
restriction sites (e.g. one or more type lis restriction enzyme recognition sites); 
and (b) one or more recombination sites. Suitable recombination sites include, 
but are not limited to, attB sites, atiP sites, attL sites, attR sites, lox sites, psi 
sites, Uipl sites, dif sites, cer sites, fit sites, and mutants, variants and 
derivatives thereof. In suitable embodiments, the isolated nucleic acid 
molecules of the present invention may optionally comprise one or more 
selectable markers, one or more topoisomerase recognition sites and/or one or 
more topoisomerases. In suitable embodiments, the topoisomerase recognition 
site, if present, may flanked by two or more recombination sites. In additional 
embodiments, the one or more recombination sites may flank one of more 
restriction sites (e.g. one or more type lis sites) and/or the one or more 
selectable markers, if present. 
[0235] The present invention also provides vectors comprising: (a) one or 

more desired nucleic acid segments; (b) optionally one or more toxic genes; 
and (c) one or more restriction sites (e.g. one or more type lis restriction 
enzyme recognition sites). Desired nucleic acid segments include, but are not 
limited to one or more genes, and one or more promoters. Suitable restriction 
sites include type Us restriction enzyme recognition sites, such as those sites 
described above. The vectors of the present invention may also comprise one 
or more recombination proteins, and one or more topoisomerase recognition 
sites and/or one or more topoisomerases. In suitable embodiments, the 
topoisomerase recognition site, if present, may flanked by two or more 
recombination sites. The vectors of the present invention optionally comprise 
suitable toxic genes, as described above. The vectors of the present invention 
may also optionally include one or more selectable marker as described 
throughout the specification. In another suitable embodiment, the vectors of 
the present invention may be "precut" by a restriction enzyme (e.g. a type lis 
restriction enzyme). This precut vector may then be used to clone one more 
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second nucleic acid molecules which may comprise sticky ends, compatible 
with the vector, or optionally, may comprise on or more restriction sites (e.g. 
one or more type lis restriction enzyme recognition sites). 

The present invention also provides methods of expressing and 
isolating nucleic acid molecules and proteins comprising: (a) obtaining one or 
more isolated nucleic acid molecules of the present invention; (b) introducing 
the isolated nucleic acid molecule into a host cell; (c) incubating the host cell 
under conditions sufficient to allow expression of a nucleic acid molecule or a 
protein encoded by the isolated nucleic acid molecule; and (d) isolating the 
expressed nucleic acid molecule or expressed protein. Host cells suitable for 
use in accordance with this aspect of the invention are described elsewhere 
herein. Suitable incubation conditions are well known in the art and are 
described in Freshney, R. I,. "Culture of Animal Cells: A Manual of Basic 
Technique," Alan R. Liss, Inc, New York (1983) and Maniatis et al., 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1982) and comprise incubating a host cell in 
a suitable growth medium with sufficient nutrients (e.g. Eagle f s Minimum 
Essential Medium, DMEM: F12 Medium, RPMI-1640 Medium, Dulbecco's 
Modified Eagle's Medium, and the like) at an appropriate temperature (about 
37°C). Methods of isolation of nucleic acid molecules and expressed proteins 
from host cells; are also well known in the art and described in Manitais id. and 
similar texts. 

| The expressed nucleic acid molecules may be suitable for use as 

interfering RNA as described above. As described throughout the 
specification, the expressed nucleic acid molecules will often not comprise 
extraneous, undesired nucleic acids, for example nucleic acids encoded by the 
one or more restriction sites (e.g. one or more type lis recognition sites). 
Similarly, the proteins produced via the methods of the present invention may 
not comprise extraneous, undesired amino acids, for example amino acids 
encoded by the one or more restriction sites (e.g. one or more type lis 
recognition sites). 
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[0238] « The present invention also provides for methods of expressing desired 
nucleic acid segments comprising: obtaining a product nucleic acid molecule 
of the invention and incubating the nucleic acid molecule under conditions (in 
vitro or in vivo) such that the' desired product nucleic acid molecule is 
transcribed and then translated. Incubation conditions for these methods of the 
invention are well known in the art as noted above. 

[0239] The present invention also provides for methods of expressing desired 

nucleic acid segments comprising: (a) obtaining a vector of the present 
invention; (b) introducing the vector into a host cell; and (c) incubating the 
host cell under conditions sufficient to allow expression of a desired nucleic 
acid segment encoded by the vector. Incubation conditions for these methods 
of the invention are well known in the art as noted above. 

[0240] Another embodiment of the present invention provides compositions 

comprising the elements described above that are involved in the various 
cloning methods of the invention. Such compositions comprise: (a) one or 
more first nucleic acid molecules comprising one or more sticky ends that 
have been generated by a restriction enzyme (e.g. one or more type lis 
restriction enzymes); (b) one or more second nucleic acid molecules 
comprising one or more sticky ends which are compatible with the one more 
sticky ends one the first nucleic acid molecule and, optionally, one or more 
selectable markers. Suitable restriction enzymes include those described 
throughout the specification, including, type Us restriction enzyme recognition 
sites. The nucleic acids comprised in any of the compositions of the present 
invention may optionally further comprise one or more selectable markers, 
one or more recombination sites, one or more topoisomerase recognition sites 
and/or one or more topoisomerases and described above. The compositions 
may comprise one or more recombination proteins. Suitable recombination 
proteins include, but are not limited to, those described throughout the 
specification. 

[0241] Another embodiment of the present invention provides compositions 

comprising the elements described above that are involved in the various 
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cloning methods of the invention. Such compositions comprise: (a) one or 
more first nucleic acid molecules comprising at least one nucleic acid segment 
flanked by one or more first restriction sites (e.g. one or more type lis 
restriction enzyme recognition sites); (b) one or more second nucleic acid 
molecules comprising one or more second restriction sites (e.g. one or more 
type lis restriction enzyme recognition sites) and optionally one or more 
selectable markers; and (c) one or more restriction enzymes (e.g. one or more 
type lis restrictipn enzymes) that are specific for said first and/or second 
restriction enzyme recognition sites. Suitable restriction enzymes include 
those described throughout the specification, including, type lis restriction 
enzyme recognition sites. The nucleic acids comprised in any of the 
compositions of the present invention may optionally further comprise one or 
more selectable markers, one or more recombination sites, one or more 
topoisomerase recognition sites and/or one or more topoisomerases and 
described above. The compositions may comprise one or more recombination 
proteins. Suitable recombination proteins include, but are not limited to, those 
described throughout the specification. 

[0242] The present invention also provides kits comprising the isolated 

nucleic acids and/or vectors of the present invention. These kits are useful for 
practicing the various methods of the invention. Kits may comprise one or 
more first nucleic acid molecules and one or more second nucleic acid 
molecules. The first nucleic acid molecule may be an isolated nucleic acid 
molecule of the invention and the second nucleic acid molecule may be a 
vector of the present invention. 

[0243] Kits of the invention may contain any number of components but 

typically will contain at least two components. Kits according to this aspect of 
the invention may comprise one or more containers, which may contain one or 
more components selected from the group consisting of one or more nucleic 
acid molecules or vectors of the invention, one or more primers, one or more 
polymerases, one or more reverse transcriptases, one or more recombination 
proteins, one or more restriction enzymes (e.g. one or more type lis restriction 
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enzymes, or other enzymes for carrying out the methods of the invention), one 
or more topoisomerases, one or more buffers, one or more detergents, one or 
more restriction endonucleases, one or more nucleotides, one or more 
terminating agents (e.g., ddNTPs), one or more transfection reagents, 
. pyrophosphatase, and the like. The kits of the invention may also comprise 
instructions for carrying out methods of the invention. 
[0244] It will be readily apparent to one of ordinary skill in the relevant arts 

that other suitable modifications and adaptations to the methods and 
applications described herein may be made without departing from the scope 
of the invention or any embodiment thereof. Having now described the 
present invention in detail, the same will be more clearly understood by 
reference to the following examples, which are included herewith for purposes 
of illustration only and are not intended to be limiting of the invention. 

Examples 

Example 1 

Expression of Interfering RNA using a Seamless Cloning Vector 

[0245] The expression of short interfering hairpin RNA molecules (shRNA) in 

vivo can decrease the expression of genes with complementary sequences by 
RNA interference (RNAi) as described previously. The seamless cloning 
vector described here (pENTR/U6) allows for rapid and efficient cloning of 
double-stranded oligonucleotide pairs (~47bp) coding fox a desired shRNA 
target sequence into a Pol HI U6 expression cassette. The resulting shRNA 
vector contains an RNAi cassette flanked by aUL sites. Therefore, the 
pENTR/U6 shRNA vectors can be used directly for transient transfection to 
test various shRNA target sequences, as well as to transfer the best shRNA 
cassettes to Lenti and Adenoviral DEST vectors for delivery into "hard to 
transfect" cells. 
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Kit Components 

[0246] Purified, Bsal-linearized pENTR/U6.2 (once it is cut with BsdL, i.e. the 

linear vector is called pENTR/U6) (Catalog No. K4945-00 and K4944-00, 
Invitrogen, Corp., Carlsbad, CA) Annealed lamin A/C control oligos: Top 5'- 
CACCGTGTTCTTCTGGAAGTCCAGCGAACTGGACTTCCAGAAGA 
ACA (SEQ ID NO:9), Bottom 5'- 

AAAATGTTCTTCTGGAAGTCCAGTTCGCTGGACTTCCAGAAGAACA 
C (SEQ ID NO: 10), Sequencing primers: 

U6 forward 5'-GGACTATCATATGCTTACCG (SEQ ID NO.ll), M13 
reverse 5 ' -C AGGAAAC AGCT ATGAC (SEQ ID NO:12)(Catalog No. N530- 
2, Invitrogen, Corp., Carlsbad, CA), T4 DNA ligase (Catalog No. 15224-025, 
Invitrogen, Corp., Carlsbad, CA) 5X T4 DNA ligase buffer (Catalog No. 
Y90001, Invitrogen, Corp., Carlsbad, CA), OneShot ToplO cells (Catalog No. 
C4040-03, Invitrogen, Corp., Carlsbad, CA). Thus, exemplary kits of the 
invention may comprise one, more, or all of these components. 

Vector Construction 

[0247] Entry vector. The nucleic acid sequence of pENTR U6.2 (Bsal-ccdB) 

is shown in Table 5, SEQ. ID. NO:l. The U6 promoter sequence was PCR 
amplified from genomic DNA (primers: 5'-AAGGTCGGG 
CAGGAAGAGGG-3 ' (SEQ ID NO:13); 5'- 

AGCGAGC ACGGTGTTTCGTC-3 ' (SEQ ID NO:14)) and TOPO cloned into 
pCR2.1/TOPO (included in kits, Catalog Nos. K4500-01, K4500-40, K4550- 
01, K4550-40, K4560-01, K4560-40, K4520-01 and K4520-40, Invitrogen, 
Corp., Carlsbad, CA). The promoter sequence was subsequently PCR 
amplified with the same primer sequences but with Asp718 and Not! sites 
appended to the primer 5' ends (5 'GTGGGTACCAAGGTCGGGCAGGAAG 
AGGG-3' (SEQ ID NO:15; 5'- 

GTGGCGGCCGCGGTGTTTCGTCCTTTCCACAAG-3' (SEQ ID NO:16)). 
This PCR product was cloned by Aspl\Z-Notl sticky end ligation into an 
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Entry vector with the pENTR/la polylinker (Catalog No. 11813, Invitrogen, 
Corp., Carlsbad, CA) andpDONR/221 backbone (Catalog No. 12536-017, and 
provided in kits 12537-023, 12538-013, 12535-019, Invitrogen, Corp., 
Carlsbad, CA). The ccdB gene was amplified from pLenti6/V5/DEST 
(Catalog Nos. V496-10 and K4960-00, Invitrogen, Corp., Carlsbad, CA) 
(primers: 5'-GTGGCGGCCGCAAAGATCCTCCAGTGGATCCGGCTTAC 
TAAAAG-3' (SEQ ID NO:17); 

5 'GTGCTCGAGAAAAAAGTCGACACGGAGCCCTCC 
AGTTATATTCCCCAGAACATC AGG-3 ' (SEQ ID NO:18)) and cloned into 
the above vector at the Notl and Xhol sites. These primers introduced Bpml 
restriction enzyme sites in the proper position at the ends of the PCR product 
and a 6bp polyT Pol III terminator. 

[0248] To engineer the Bsal vector, a double stranded oligo containing a Bsal 

site and Notl site (5'GAGACCGCGGCCGCTTCTCGAGGTCTCATT (SEQ 
ID NO:19) + 5'TGAGACCTCGA GAAGCGGCCGCGGTCTCCG-3 ' (SEQ 
ID NO:20)) was cloned into i?/wjl-digested plasmid. The resulting plasmid 
was digested with Notl and Xbal and ligated to a new ccdB region PCR 
amplified (primers: 5 'CACGCGGCCGCTGGATCCGGCTTACTAAAAG-3 ' 
(SEQ ID NO:21); 5 ' C ACTCTAGAA 

AAAATGAGACCTTATATTCCCCAGAACATCAGG-3 ' (SEQ ID NO:22)) 
with a Notl site on one end and a Bsal site, 6bp polyT Pol m terminator, and 
Xbal site at the other. The final construct is named pENTR/U6.2 (Bsal-ccdB). 

[0249] LacZ expression control vector. The LacZ expression control 

plasmid, PCDNA2.2 1 MS/GW/LacZ was made using Multi-site Gateway 
(CMVlacZV5). pENTR5'-CMV, pENTR-IacZ and pENTR/V5TKpolyA 
were mixed with the DEST R4R3 plasmid using LR Plus Clonase. The three 
plasmids in the Multi-site reaction were all created by a standard Gateway 
recombination reaction: 1) the CMV promoter was amplified from pcDNA3.1 
(Catalog No. V790-20 and V795-20, Invitrogen, Corp., Carlsbad, CA) using 
primers flanked with attBA and atfBl sequences and recombined with pDonr 
5'(P4-P1R) to form pENTR5'-CMV. 2) The LacZ gene was amplified from 
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pcDNA3.1-£acZ using attBl and a«B2 flanking primers and recombined 
with pDonr 221 to create pENTR-ZacZ, and, 3) the V5-TKpolyA element was 
amplified from pcDNA3.2 using attB2 and attB3 primers and recombined 
withpDonr3'(P2-P3R). 
[0250] Preparation of linear pENTR/U6.2 ready for cloning. pENTR/U6.2 in 

DB3.1 cells was grown in LB media with 50ng/ml kanamycin. Plasmid DNA 
was purified by SNAP midi prep with a yield of 67ug/50ml of culture. Ten ng 
of vector was digested with Bsal at 50°C in 200ul with 5 units of BsaV]ig of 
DNA for 2 hrs. After addition of 1 .5vol of SNAP miniprep binding buffer, the 
reaction was added to a SNAP miniprep column, washed according to the 
SNAP protocol for miniprep DNA, and eluted in 100^1 ddH20 and stored at - 
20°C. 

[0251] ShRNA Oligonucleotide Annealing. DNA oligonucleotides of 46-53nt 

were produced with desalt purification only. Individual oligos were diluted in 
ddH20 to a final concentration of 200uM as verified by spectrophotometric 
analysis at OD 2 6o- Complementary oligos were mixed to the final desired 
concentration with either: 1) TE (lOmM Tris pH 8.0, ImM EDTA), 2) 10X 
Annealing Buffer and ddH20 such that the final, IX buffer was lOmM Tris pH 
8.0, lOOmM NaCl, ImM EDTA, or 3) the same buffer as in 2 but with a final 
concentration of lOmM MgCl 2 . (For example, to create a 50uM stock of a ds- 
oligo in 20|jl, 5^1 of each 200jaM ss complementary oligo was mixed with 2jll1 
of 10X Annealing buffer and 8pl of ddH20). Mixed oligo pairs were heated 
and cooled in either an MJ thermocycler (94°C for 2 min, then decreased by 
0.1 °C every second to 25°C, and stored at 4°C) or incubation in a 95°C bath 
for 4min, then cooling to room temperature over 15min before putting the 
sample on ice. Annealed ds-oligos were diluted to the desired concentration 
with TE at room temperature. 

[0252] Cloning target site DNA oligos into pENTRAJ6. Bsal cut pENTR/U6.2 

and ds-oligos were incubated in a 20jnl reaction using 5 times ligase buffer and 
IjjlI ligase for 5 min at room temperature. Two microliters of the ligation 



-93- 



reaction were added to chemically competent Top 10 One Shot cells (Catalog 
Nos. C4040-10, C4040-03, C4040-06, Invitrogen, Corp., Carlsbad, CA, 
~50pl), incubated on ice for 20min, heat shocked at 42°C for 30 sec, and 
placed back on ice, followed by the addition of 250(il SOC and incubation at 
37°C (shaking) for 1 hr. Ten to one hundred microliters of this transformation 
reaction were plated on LB Kan (50^g/ml) agarose plates. 

[0253] The number of colonies per plate was determined after an overnight 

incubation at 37°C. A supercoiled pUC19 (2fil of a lOpg/pl stock) 
transformation control was performed with each set of cells transformed; in 
this case the transformation efficiency is reported as number of colony 
forming units per microgram. 

[0254] Sequence analysis ofpENTR/U6 shRNA target clones. Plasmid DNA 

was isolated from pENTR/U6 clones using the SNAP mini prep kit (Catalog 
No. Kl 900-01, Invitrogen, Corp., Carlsbad, CA)under standard conditions. 
Two different primers were used for sequence analysis: 

1) U6 forward, 5 ' -GGACT ATC AT ATGCTT ACCG (forward primer, 
binds in U6 promoter 55bp from the 3' end of the U6 promoter)(SEQ ID 
NO: 11) 

2) M13 R, 5'-CAGGAAACAGCTATGC (reverse primer, binds 
"downstream" from the AttL2 site, 146bp from the pol III termination)(SEQ 
ID NO: 12) 

[0255] Gateway LxR recombination. 1 5 Ong of each pENTR/U6 shRNA clone 

and 150ng of pLenti6/PL-DEST or 300ng of pAD/PL-DEST (Figure 10) 
(Catalog No. V494-20, Invitrogen, Corp., Carlsbad, CA) were incubated in a 
20|al reaction using the 5X buffer and 5X LR Clonase enzyme mix, and 
incubated at 25°C for lhr. Two microliters of this LxR reaction were 
transformed into chemically competent cells as described above except that 
selection plates had 50ug/ml ampicillin instead of kanamycin. 



-94- 



ShRNA transfections 

[0256] All transfections were carried out in 24-well plates. For luciferase and 

p-galactosidase (P-gal) knockdown experiments, 600ng of pENTRAJ6-shRNA 
vectors were cotransfected with lOOng each pcDNA5/FRT/luc and the 
pcDNA1.2/V5-GW/focZ positive control plasmid into GripTite™ 293 cells 
(Catalog No. R795-07, Invitrogen, Corp., Carlsbad, CA) using Lipofectamine 
2000™. Briefly, cells were plated the day before transfection in 0.5ml 
medium lacking antibiotics at 2 x 10 5 cells per well. On the day of 
transfection, cells were typically 90-95% confluent. For each well, 2fil of 
Lipofectamine 2000™ were diluted with 48jxl OptiMEM, incubated 5 min at 
room temperature, then mixed with DNAs diluted with OptiMEM to 50|xl. 
Complexes were incubated an additional 20min at room temperature before 
addition to cells. Medium was changed 3hr after transfection to minimize 
toxicity. 

Luciferase and p-gal assays. 

[0257] After 48hr, GripTite™ 293 cells were lysed in 0.5ml luciferase lysis 

buffer (25mM Tris-HCl pH 8.0, O.lmM EDTA pH 8.0, 10% glycerol, 0.1% 
Triton X-100) and subjected to a ~80°C freeze-thaw. 50p.l of each lysate was 
used in a luciferase luminescence assay (Promega) while another lOul was 
used in a P-gal luminescence assay (Tropix) according to the manufacturers' 
instructions. 

Results 

[0258] The vector pENTR/U6 is designed to express shRNA in mammalian 

cells for use in RNAi. (pENTR/U6.2 is the supercoiled vector containing the 
ccdB gene; once linearized with Bsdi, the vector will be referred to as 
pENTR/U6.) pENTR/U6 allows the cloning of shRNA target sequences 
between the human U6 pol HI promoter and a 6 T termination signal in a 
Gateway Entry (ENTR) vector. In this case, the entire RNAi cassette (U6 
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promoter, cloning site, and temiination signals) is between the attlA and attL2 
recombination sites. Therefore, U6 driven expression of an shRNA is possible 
directly from ENTR vector and does not require subsequent LxR transfer to a 
DEST vector. 

Vector preparation 

[0259] pENTR/U6.2 (Bsal-ccdB) is digested with the type DS restriction 

enzyme Bsal in preparation for cloning ds-oligos (~47mers) containing 
shRNA target sequences. Type lis restriction enzymes cut outside of their 
recognition sequence and can therefore be used to create sticky ends of any 
sequence in the vector. In this case, the Bsal digest leaves the 4nt 5' ssDNA 
end 3'-GTGG-5* at the end of the U6 promoter and the single stranded 3'- 
TTTT-5' at the other vector end (the first four Ts of the termination signal). 

[0260] Digestion of the pENTR/U6.2 by Bsal generates three fragments 

(2850, 577, and 91bp). The linearized cloning vector is 2850bp; smaller 
fragments derive from the ccdB gene (ccdB has a Bsal site). Removal of the 
smaller fragments from the final vector prep is not required; however, the 
amount of the 91bp fragment recovered from the SNAP purification can vary. 
Uncut pENTR/U6.2 or clones that have reassembled the functional ccdB gene 
will not propagate in Top 10 cells. The cloning efficiency of either small 
fragment alone is very low due to non-compatible ends. 

Insert Annealing 

[0261] A five-minute bench top ligation and subsequent transformation is 

highly efficient at cloning dsDNA oligo shRNA target sequences - if the oligo 
inserts are properly annealed. A typical 46nt ss-oligo is made of a 4nt 5' 
cloning overhang followed by 19nt of "sense" and a complementary 19nt 
"antisense" sequence connected by short 4nt "loop." Thus the oligos can form 
a ~19bp DNA intra-molecular hairpin. Therefore, conditions must be 
optimized to favor intermolecular annealing between two different 
complementary oligos rather than the produc]tion of single-strand 
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intramolecular hairpins. The formation of intermolecular ds-oligos can be 
accomplished by melting (healing to 94°C) and cooling complementary oligos 
at high concentrations in the appropriate buffer. 

[0262] Intermolecular double-stranded molecules can be formed in annealing 

buffers containing either 20 or lOOmM NaCl when the oligo concentration is 
SOpM during the heating and cooling cycle. The ds-molecules can be 
separated from the single-stranded hairpins in an E-gel. Additionally, no 
difference was noted between using the Thermocycler or water bath protocols 
to melt/cool the reaction. 

[0263] Upon closer examination of the salt and oligo concentration, a buffer 

without any NaCl (TE) would not support formation of ds-47mers even at 
lOOpM concentrations, adding MgCl 2 to lOOmM NaCl had no effect, and 
oligo concentrations of less the 50\iM were compromised in the amount of ds- 
47mers created. 

[0264] Once created, the dsDNA 47mer shRNA inserts can be diluted in TE 

for cloning. After the ds-47mers are diluted, they are stable at 4°C overnight, 
but will form single strand hairpins if melted, i.e. incubated at temps above 
42°C. 

[0265] Heating and cooling of shRNA target oligos at concentrations of 50pM 

or greater in lOmM Tris pH 8.0, lOOmM NaCl, ImM EDTA creates a mixture 
of ~50:50 ds/hairpin molecules which can be effectively cloned into Bsal 
linearized pENTR/U6 (see pENTR/U6 cloning, below). 

Gateway ENTR vector testing 

[0266] The supercoiled pENTR/U6.2 (Bsal-ccdB) vector, prior to 

linearization for cloning, passes the criteria set for Gateway ENTR vectors (> 
104 killing by ccdB). Supercoiled pENTR/U6.2 was transformed into E. coli 
cells it should kill (ToplO and HB101 cells) as well as the DB3.1 cell line 
designed to propagate plasmids with the ccdB gene. pENTR/U6.2 transforms 
DB3.1 cells 1.3 x 10 4 times better than ToplOs cells once the number of 
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colonies per plate are adjusted for the different transformation efficiencies of 
the different cell lines (the ToplO cells were -200 times more competent than 
the DB3.1 cells and -400 times more competent than the HB101 cells). 
[0267] When Bsal digestion of pENTR/U6.2 is complete, most of the 

supercoiled vector is linearized. Transformation of Bsal cut, SNAP purified 
pENTR/U6 vector only generated a small number of "background" colonies 
per plate in ToplO or DB3.1 cells. Eight colonies were obtained in DB3.1 
cells and all looked like the parent sc pENTR/U6.2 by KFLP analysis (data not 
shown) indicating the Bsal digest is efficient and only a small fraction of the 
plasmids are left uncut after the 2hr incubation. In ToplO cells only 4 colonies 
were obtained; RFLP analysis of these indicated two classes, neither of which 
was the parent plasmid (possibly pENTR/U6 closed without the ccdB gene 
and one fragment of the ccdB gene re-cloned). 

pENTR-U6 cloning 

[0268] A five-minute bench-top ligation is an easy and efficient method to 

clone shRNA target sequences into pENTR/U6. The cloning process was 
optimized over a wide range of vector concentrations (20pg - 5ng) and insert 
concentrations (0.4pg - lOng) with the shRNA target sequence lacZ-19. All 
the optimization of the cloning reaction was done with ds-oligos annealed at a 
concentration of 5G|jM prior to dilution in TE and transformation into 
chemically competent ToplO cells. Sequence analysis of the shRNA clones 
demonstrate that >90% have inserts in the correct orientation. 

[0269] Greater than 15 other ds-oligo inserts, each with a different shRNA 

target sequence, have been cloned into pENTR/U6 under comparable 
conditions. In all cases, the number of colonies generated was similar to the 
numbers of colonies generated with the lacZ-19 ds-oligo. No significant 
difference has been noted in how different inserts clone into the pENTR/U6 
vector. 
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Sequence analysis 

[0270] The efficiency of cloning shRNA target-sequence inserts was 

determined by sequence analysis through shKNA target sequences. Analysis 
of the lacZ-19 shRNA target inserts cloned in pENTR/U6 under the 
recommended conditions, demonstrated that 100% (38/38) of the randomly 
selected clones have an insert cloned in the correct orientation. 

[0271] Sequence analysis with the U6 forward primer provides excellent 

sequence through the cloned shRNA target sequence. It is designed for ease 
of analysis of the cloned oligos, binds the U6 promoter inside the attL sites 55 
bases from the cloning junction, and allows for the analysis of the entire 
cloned insert with a 100 base "read" before the "downstream" atiLZ site. 

RNAi by transient Transfections 

[0272] Post-transcriptional inhibition of luciferase (GL2) and lacZ expression 

was evident upon expression of shRNA targets from the pENTR/U6 vector 
(Figure 3A). Specific inhibition is evident with pENTR/U6 shRNA clones 
targeting Luciferase and lacZ expression from co-transfected reporter 
constructs. The Luciferase pENTR/U6 GL2-22 construct inhibits expression 
of GL2 Luciferase but not lacZ (Figure 3A); similarly, the pENTR/U6 with 
the lacZ-19 shRNA target sequence (the target provided as a control in this 
kit) inhibits lacZ expression from pcDNA1.2/V5-GW//acZ (the control 
expression vector for this kit) - but not Luciferase (Figure 3B). 

[0273] Similar inhibition of both lacZ and Luciferase is evident with shRNAs 

that target different sites, although not all shRNA sequences are effective 
(Figures 4A and 4B). The kit control lacZ-19 target site presented in 
Figure 4B is the same shRNA target site used in Figure 3B, and only the 
lacZ4-AS sequence inhibits expression to the same degree. The lacZ4-SA 
only moderately inhibits expression and the lacZ5 clones have little if any 
inhibitory effect. Similarly, the GL2sh2 and GL2-22 (AS) target sites are the 
most effective shRNA clones tested at inhibiting luciferase expression 
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(Figures 4A and 4B). Interestingly, the sense to anti-sense orientation of the 
shRNA target sequence can make a considerable difference in the level of 
inhibition at a specific target (Figures 4A and 4B). However, the optimal 
orientation (sense-loop-antisense (SA) or antisense-loop-sense (AS)) is not 
clear; with Luciferase, the AS orientation was most effective, but with lacZ the 
SA orientation was most effective (Figure 4A, ENTR/U6-A6-GL2-22 AS vs. 
SA, and Figure 4B, ENTR/U6-A6-lacZ4-AS vs. SA). 
[0274] Additionally, the lacZ-19 shRNA target sequence was tested in 

derivatives of the pENTR/U6 vector with terminators of 4-8 Ts. All the 
terminators behaved similarly (Figure 5). 

Gateway LxR cross 

[0275] Any shRNA target sequence cloned into pENTR/U6 can easily be 

transferred as a U6 RNAi cassette to a Gateway DEST vector by attL x attR 
(LxR) recombination at the att sites. Following is a demonstration of the 
efficiency of LxR transfer. The lacZ-19 target sequence cloned into 
pENTR/U6 was transferred into pLenti6/PL-DEST and pAD/PL-DEST by a 
standard LxR Clonase catalyzed recombination reaction (See, e.g., Figs. 38 
and 39) as described previously (See U.S. Patent Nos. 5,888,732; 6,143,577; 
6,171,861; 6,277,608; and 6,720,140; the disclosures of which are 
incorporated by reference herein in their entireties). Additionally, 12 different 
pENTR/U6 shRNA target subclones, including target sequences to Lamin AC 
and Luciferase, were also recombined into these two DEST vectors. In all 
cases, the LxR crosses were efficient. When 2|ul/20jal LxR reaction were 
transformed and l/6th (50pl) of the transformation reaction plated, 300-800 
colonies/plate were obtained in ToplO cells. Even in HB101 cells that were 
-40 fold less competent to take up DNA than the ToplO cells, 10-20 
colonies/plate could be obtained by plating more of the transformation 
reaction (lOO^il vs. 50pl). Note that the number of clones obtained are similar 
between the Lenti DEST and the Adeno DEST vectors, even though the 
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Adenoviral vector is almost 4 times the size of the Lentiviral vector (~36kb vs. 
~8.6kb). 

[0276] The LxR crosses were not only efficient but also effective. Ten out of 

ten of the Adeno DEST vector recombinants had the correct RNAi cassette as 
determined by RFLP analysis. pLenti DEST recombinants were transformed 
into both ToplO and HB101 E. coli cells because HB 101 cells are known for 
reducing the recombination between the lentiviral LTR sequences. In this 
case, 10/10 recombinants were correct using HB101 cells. 

shRNA Target Site Selection 

[0277] The present invention may be used to create shRNAs with any desired 

stem length, orientation, and loop sequence. In general, target sequences 
should be complex (no runs of more than 3 of the same nucleotide), with low 
GC content (30-50%), and avoid known RNA-protein interaction sites. Target 
sites should be a minimum of 19nt, and sites of up to 29nt are effective. 

DNA Oligo Insert Design 

[0278] Once a candidate target site has been selected, it must be converted 

into an shRNA sequence, and the DNA oligos ordered for cloning into 
pENTR/U6. The shRNA sequence can be in two possible orientations. Either 
the sense target site or the antisense sequence of the target site can begin the 
shRNA, followed by a short loop sequence and then the Opposite strand of the 
target site. 

[0279] The fact that the polymerase (pol HQ will terminate transcription after 

4 thymidines (Ts) constrains the oligo design. Strings of more than 3 Ts 
should be avoided in the middle of a target site, or with any Ts in the 
connecting "loop", to prevent early termination. Additionally, Ts at the 3' end 
of the target will abut the polyT terminator and may cause slightly premature 
termination. Changing the sense/antisense orientation of the shRNA may be 
necessary for specific target sites to avoid early pol IE termination by 
positioning different sequences next to the loop or polyT terminator. 
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[0280] Additionally, the native U6 snRNA initiates at a guanosine (G), and 

this -l-l base is believed to be important. Although this system allows 
advanced users to choose any +1 base, we have designed all of our inserts to 
initiate at a G. In cases where the G is part of the target sequence, it is simply 
incorporated into the stem, with a complementary cytosine base placed just 
before the terminator. When G is not the first base in the sense or antisense 
target sequence, it is added to the 5' end of the shRNA with no 
complementary base at the 3 5 end. If use of a G is not desired, an A is 
believed to be better than an C or T. 

[0281] Functional loops of anywhere from 4 to 1 lnt have been reported in the 

literature. Short loops are preferred as they reduce the lengths of the oligos 
needed for cloning. 5'-TTCG, 5'-AACG, and 5'CGAA have been used as the 
loop sequences in successful RNAi constructs. However, loops containing 
thymidines must be avoided in some cases as they may cause early 
termination, as discussed above. 

[0282] Finally, to convert an shRNA sequence into an oligo pair for insertion, 

5'CACC-3' was added to the 5' end of the shRNA sequence to create the 
"top" oligo. The "bottom" oligo is the complimentary sequence of the top 
oligo with the 5'CACC-3' removed and 5'AAAA-3' appended to the 5' end. 

Conclusion 

[0283] The pENTR/U6 and Gateway DEST vectors are the cornerstones of a 

superior system to clone shRNA target sequences into an RNAi expression 
cassette and deliver it to cells (Figure 28). Two other commercial sources 
with similar pol HI vectors (Ambion with pSilencer, and OligoEngines with 
pSuper) require the synthesis of longer insert oligos (~70nt and 55nt 
respectively) because their cloning schemes need the end of the U6 promoter 
and termination signals to be "built-back" with the insert. Additionally, their 
cloning protocols call for ligation incubations of lhr or greater compared to 
the 5 min bench-top reaction described here. This is likely due to the PEG 
present in the present ligation buffer, as well as the present vector design 
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features that eliminate background (the ccdB negative selection and the non- 
compatible ends left after Bsal digestion). The present invention also has the 
Gateway Advantage; any insert cloned and sequence verified in pENTR/U6 is 
then available for any application made possible by the DEST vectors - such 
as viral delivery of shKNA by VirapowerTM. 
[0284] The demonstrations of RNAi in transient txansfections reported here, as 

well as examples of successful RNAi by transduction indicate the U6 
promoter can generate sufficient shRNA for RNAi. Experiments that define 
the rules required for efficient RNAi will make Ijhis vector all the more 
valuable. 



Example 2 

Expression of Interfering RNA using a Seamless Cloning Vector 
Abstract and Introduction 



[0285] Short hairpin RNA (shRNA) expression cassettes built into the U6 

RNAi Entry Vector can be used to transiently knockdown genes of interest in 
cell culture. However, the Entry Vector carries no marker for selection in 
mammalian cells, and the plasmids must be introduced into cells by 
transfection. Transfection efficiency varies widely between cell lines and is 
ineffective in primary and terminally differentiated cells. In contrast to 
plasmid transfection, lentiviral delivery allows simple, stable transduction of a 
wide variety of cell types including primary and terminally differentiated cells. 
A number of recent publications describe the use of lentiviruses to deliver 
shRNAs to mammalian cells (Abbas-Terki et al 2002, Dirac & Bernards 
2003, Matta et al 2003, Qin et al 2003, Rubinson et al 2003, Stewart et al 
2003, Tiscornia et al 2003), demonstrating an existing interest in this 
technique. 
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[0286] Invitrogen offers several Gateway-adapted lentiviral vectors for 

cloning of coding sequences downstream of a Pol II promoter. However, the 
presence of such an upstream promoter may interfere with Pol in expression 
from a U6 cassette. A promoterless Destination vector, pLenti6/RNAi-DEST 
has been created with atiRl and attR2 sites compatible with the U6 RNAi 
Entry Vector. A map of pLenti6/RNAi-DEST is shown in Figure 6A. 
pLenti6/RNAi-DEST allows simple and reliable transfer of shRNA expression 
cassettes into the lentiviral backbone. The viral vector confers blasticidin 
resistance for selection of stably transduced cells. Transduction by 
lentiviruses expressing lamin A/C shRNAs is demonstrated to efficiently and 
specifically knock down endogenous protein levels. pLenti6/RNAi-DEST 
complements the ViraPower™ product line and provides a powerful new 
application for the U6 RNAi Entry Vector. 

[0287] Key Performance Criteria for Lenti6/RNAi-DEST include: (1) 

pLenti6/RNAi-DEST passing standard manufacturing QC specs for 
Destination vectors. (2) Gateway cloning shRNAs into pLenti6/RNAi-DEST 
and packaging virus at levels comparable with regular vectors. (3) Showing 
specific knockdown of endogenous lamin A/C gene. 

Materials and Methods 

[0288] Construction of pLenti6/RNAi-DEST Vector Lenti6/RNAi-DEST is 

the product of a Gateway BxP reaction between pLenti6/PUattB4/V5/GW~ 
GFP and pDONR 221. The BxP reaction was transformed into DB3.1 and 
selected on LB media containing Ampicillin (100 \xg/ml) and chloramphenicol 
(15 yg/ml). Colonies of the transfonnants were analyzed by restriction digest. 
A map of pLenti6/RNAi DEST is shown in Figure 6A. 

ShRNA-containing Entry Cones 

[0289] The various shRNA-containing Entry clones used are set out in 

Table 1. The hairpins are targeted to sites on the lamin A/C or luciferase 
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genes as indicated. All entry clones were created by oligo cloning into 



pENTR/U6.2. Loops and stems choices are described in Example 1. 



Table 


I. pENTR/U6 Entry Clones 


Clone name 


Target gene 


Orientation a 


Loop 
sequence 


Stem length* 
(bp) 


Target 
position 0 (nt) 


pENTR/U6- 
lamAC-SA-uucg 


lamin AJC 


SA 


UUCG 


19 


610-628 


pENTR/U6- 
lamAC-AS-uucg 


lamin AJC 


AS 


UUCG 


19 


610-628 


pENTR/U6- 
lamAC-AS-cgaa 


lamin A/C 


AS 


CGAA 


19 


610-628 


pENTR/U6- 
lamAC-SA-cgaa 


lamin AJC . 


SA 


CGAA 


19 


610-628 ' 


pENTR/U6-GL2- 
22 


luciferase 


AS 


UUCG 


22 


153-174 


pEMTR/U6- 
GL2sh2 d 


lucifexase 


AS 


GAACGT 
TG 


29 


1355-1383 



a Orientations are either sense-loop-antisense (SA) or antisense-loop-sense (AS). 
b Stem length does not include +1 G base if it is not also part of the target site, 
target position is relative to start codon. 

hairpin design based on a previously assessed technology from Cold Spring Harbor 
Laboratories. 



Destination Vector QC and generation of expression control vector 

[0290] pLenti6/RNAi-DEST was monitored for quality using' the official 

"Dest Vector QC Procedure" established by manufacturing. The expression 
control plasmid, pLenti6/RNAi/U6-GW/lamAC was generated by a standard 
Gateway LxR reaction between pLenti6/RNAi-DEST and pENTR/U6- 
lamAC-AS-cgaa. Clones of pLenti6/RNAi/U6-GW/lamAC were confirmed 
by restriction analyses. A map of pLenti6/RNAi/U6-GW/lamAC is shown in 
Figure 6B. 

Cell culture 



[0291] 293FT cells were cultured in DMEM/10% FBS/L-glutamine/non- 

essential amino acids/penicillin/streptomycin containing 500 jig/ml G418. 
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HeLa cells were cultured in DMEM/10% FBS/L-glutamine/non-essential 
amino acids/penicillin/streptomycin. 

Virus production 

[0292] For virus production, 1 x 10 7 293FT cells were plated per T175 flask. 

Twenty-four hours later, culture medium was replaced with 20 ml 
OptiMem/10%FBS, and shRNA-encoding viruses were packaged by co- 
transfecting the 293FT cells with the respective lentiviral vector and pLPl, 
pLP2 and pLP/VSVG (at a mass ratio of 1:1:1:1, 24 jag of total DNA) as 
follows: The 24 jag DNA was mixed with 3 ml of OptiMem media. In a 
separate tube, 72 fxl of Lipofectamine 2000 was also mixed with 3 ml of 
OptiMem media. After a 5-minute incubation period at room temperature, the 
two mixtures were combined and incubated at room temperature for an 
additional 20 minutes. At the completion of the incubation period, the 
transfection mixture was added to the cells dropwise and the flask was gently 
rocked to mix. The following day the transfection complex was replaced with 
30 ml complete media (DMEM, 10% FBS, 1% penicillin/streptomycin, L- 
glutamine and non-essential amino acids). Virus-containing media were 
harvested at day 2 and day 3 post-transfection, centrifuged at 3000 rpm for 5 
minutes to remove dead cells, and filtered through sterile 0.45 micron 
cellulose acetate filters to remove fine debris. Viruses in the filtrates were 
concentrated by ultracentrifugation (90 minutes, 23000xg, 4°C). Viral pellets 
from ultra-centrifugation were resuspended in 500-600 \xl growth media. One 
hundred-microliter aliquots of concentrated virus were stored in -80°C freezer 
until use. 

Viral Titering and Transduction 

[0293] All applications of virus to cells were performed in the presence of 6 

^g/ml polybrene (Sigma, hexadimethrin bromide, #H9268) and media changes 
were performed 12-24 hours post transduction. For titering virus, 6-well 
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plates were seeded with 2 x 10 5 HT1080 cells per well the day before 
transduction. One milliliter each of ten-fold serial dilutions of viral 
supernatant ranging from 1CF 2 to 10" 8 was prepared. All dilutions were mixed 
by gentle inversion prior to adding to cells. Mock-transduced cells had no 
virus added to them. Plates were gently swirled to mix. The following day, 
the media was replaced with complete media. Forty-eight hours post- 
transduction, the cells were placed under 10 ng/ml blasticidin selection. After 
7 to 10 days of blasticidin selection the resulting colonies were stained with 
crystal violet : A 1% crystal violet solution was prepared in 10% ethanol. 
Each well was washed with 2 ml PBS followed by 1 ml of crystal violet 
solution for 10 minutes at room temperature. Excess stain was removed by 
two 2 ml PBS washes and colonies visible to the naked eye were counted to 
determine the viral titer of the original supernatants. 
[0294] Transductions to test shRNA activities were performed in the 

appropriate cells in 12-well plates. Cells were plated at 1 x 10 5 /well twenty- 
four hours before transduction. The next day, the media was replaced with 
complete media. Transduction was conducted in a final volume of 500 pi and 
contained the appropriate volumes of virus supernatant to achieve a range of 
MOIs. 

Cell lysis and Western Blot 

[0295] Cell lysis for lamin A/C and beta-actin western blots were performed 

as follows: Forty-eight or 120 hours post-transduction, cells were harvested 
with Versene (Invitrogen), transferred to microfuge tubes, and centrifuged at 
3000 RPM for 4min. Pellets were lysed in 2X NuPAGE® LDS Sample 
Buffer with IX Sample Reducing Agent and denatured at 95°C for 5 min prior 
to electrophoresis. Protein samples were electrophoresed on NuPAGE® 
Novex 4-12% Tris-Bis Gels in IX MOPS-SDS buffer with NuPAGE® 
Antioxidant in the upper chamber. Western blot analyses were performed 
using the Western Breeze Immunodetection Kit (Invitrogen) according to the 
manufacturer's protocol. Lamin A/C and beta-actin proteins were detected 
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using 1:1000 monoclonal anti-lamin A/C (BD Biosciences) and 1:5000 
monoclonal anti-beta-actin (Abeam) antibodies, respectively. 

Results and Discussion 

[0296] Destination Vector QC pLenti6/RNAi-DEST passed the standard 

manufacturing QC specs for Destination vectors with respect to total colony 
count (Table 2) and ccdB assay (Table 3). 

Virus Titers 

[0297] ShRNA-encoding lentiviral vectors were used to produce virus in 

293FT cells. The vectors produced viral titers comparable to titers attained 
with regular lentiviral vectors that do not contain shRNA (Table 4). This 
indicated that introduction of shRNAs into the lentiviral backbone does not 
compromise virus packaging or transduction efficiency. 



Table 4. Lenti6/RNAi Virus Titers 



Virus 


Crude Virus 
Titer (cfu/ml) 


Concentrated 
Virus Titer 
(cfu/ml) a 


Lenti6/RNAi/U6-GW/lamAC-SA-uucg 


1.00E+6 


4.30E+08 


Lenti6/RNAi/U6.2-GW/lamAC-AS-uucg 


2.10E+6 


5.85E+08 


Lenti6/RNAiAJ6.2-GW/amAC-AS-cgaa 


8.00E+5 


1.35E+08 


Lenti6/RNAi/U6.2-GW/lamAC-SA-cgaa 


1.20E+6 


4.45E+08 


Lenti6/RNAi/U6-GW/GL2-22 


6.00E+5 


4.50E+08 


Lenti6RNAi/U6-GW/GL2sh2 


1.30E+6 


5.20E+08 


Lenu6/V5-GW/GFP(non-RNAi virus) 


4.00E+5 


8. OE+07 



Concentrated from two 175cm 2 flasks each. 



Knockdown of Lamin A/C 

[0298] Lentiviruses were tested for their ability to deliver shRNAs to 

specifically knock down lamin A/C expression in HeLa cells. Lentiviruses 
expressing luciferase-targeted shRNAs served as negative controls. Inhibition 
of lamin A/C expression was analyzed by western blot. ShRNAs targeted to 
lamin inhibited expression of both lamin A and C isoforms 48hr and 5 days 
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post-transduction (Figure 7). The extent of inhibition depended on transduced 
MOI, indicating knockdown was dose-dependent. Lentiviruses encoding 
shRNAs lamAC-AS-cgaa and lamAC-SA-cgaa provided the best lamin 
knockdowns (Figure 7, top panel lanes 11-16; bottom panel lanes 14-19). Of 
the two shRNAs, lamAC-AS-cgaa mediated robust inhibition even at the 
relatively low MOI of 14 (Figure 7, top panel lane 11 and bottom panel lane 
14). The lamin A/C shRNAs had no effect on beta-actin expression 
irrespective of transduced MOI (Figure 7, beta-actin blots). Control luciferase 
shRNAs had no effect on beta-actin expression (Figure 7, top panel lanes 7-9 
and 17-19; bottom panel lanes 1-3 and 11-13) and minor effect on lamin A/C 
expression even at the very high MOI of 520 (Figure 7, top panel lane 19; 
bottom panel lane 13). These results show specific inhibition of lamin 
expression with lamin-targeted shRNAs. The inhibition is not the effect of 
general inhibition of gene expression. Results of the control shRNA 
transduction provide further evidence of the specific activity of the lamin- 
directed shRNAs. 

[0299] pLenti6/RNAi has also been used to specifically knock down 

luciferase (75% inhibition, 48 hrs post-transduction in Flp-In 293 luc cell line; 
data not shown) and lacZ at high MOIs (55% inhibition, 96 hrs post- 
transduction in HT1080LacZ cells; data not shown). These provide further 
evidence that pLenti6/RNAi-DEST vector will function with other RNAi 
cassettes. 

Summary 

[0300] Gateway-adapted lentiviral vector pLenti6/RNAi-DEST has been 

developed for RNAi analyses. pLenti6/RNAi-DEST is designed to be used in 
LxR reactions with pENTR/U6. pLenti6/RNAi-DEST meets the performance 
criteria for all DEST vectors as well as criteria for packaging and transducing 
lentiviruses. Viruses Lenti6/RNAi/U6-GW/lamAC-AS-cgaa and 
Lenti6/RNAiAJ6-GW/lamAC-SA-cgaa transduce shRNAs that specifically 
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knock down lamin A/C expression. The lamAC-AS-cgaa hairpin was chosen 
as the positive control for the U6 RNAi Entry and pLenti6/RNAi Kits. The 
sequence of lamAC-AS-cgaa hairpin is shown in the Kit Components and 
Configuration below. 

Example 3 
RNAi using BLOCK-iT™ Dicer Kit 

[0301] BLOCK-iT™ Kits (Invitrogen Corporation; Carlsbad, CA) can be used 

for fast and efficient RNAi applications. Eukaryotic cells naturally regulate 
gene expression with dsRNA. A BLOCK-iT™ Dicer Kit can be used to 
generate dsRNA that are then diced into siRNA, purified and transfected into 
cells. The BLOCK-iT™ Dicer Kit requires no expensive synthetic siRNAs. It 
also produces a pool of many siRNAs per gene, not just one or a few, which 
means a higher probability of knockdown (Figure 21,22, and 23). A 
purification procedure gives a high yield of siRNAs in a transfection-ready 
buffer and virtually eliminates remaining long dsRNA and cleave 
intermediates. 

[0302] BLOCK-iT™ Long RNAi Transcription Kits use a T7 TOPO linker 

which allows any polymerase chain reaction (PCR) product to become a 
template for transcription (Figure 20). This mediates RNAi in invertebrates 

(e.g., insects, nematodes and protozoans), some mammalian embryonic cells 

> 

(undifferentiated ES cells) and many mammalian cell lines after treatment 

with Dicer/RNase IE, BLOCK-iT™ Kits allows for an inexpensive 

alternative to siRNA oligos. Exemplary uses of BLOCK-iT™ Kits are 

summarized in Figure 24. 

Kit Components and Configurations 
Complete Lentiviral RNAi Kit: 
Components of the U6 RNAi Entry Vector Kit: 
[0303] Purified, Bsal-linearized pENTR/U6.2; Annealed lamin A/C control 

oligos: Top 5'-CACCGTGTTCTTCTGGAAGTCCAGCGAACT 
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GGACTTCCAGAAGAACA (SEQ ID NO:9), Bottom 5'- 
AAAATGTTCTTCTGGA 

AGTCCAGTTCGCTGGACTTCCAGAAGAACAC (SEQ ID NO: 10); 
Sequencing primers: U6 forward 5 ' -GGACTATC AT ATGCTTACCG (SEQ 
ID NO:ll), M13 reverse 5' J CAGGAAACAGCTATGAC (SEQ ID 
NO:12)(Catalog No. N530-02, Invitrogen Corp., Carlsbad, CA); T4 DNA 
ligase (Catalog No. 15224-025, Invitrogen Corp., Carlsbad, CA); 5X T4 DNA 
ligase buffer (Catalog No. Y90001, Invitrogen Corp., Carlsbad, CA Y90001); 
OneShot ToplO cells (Catalog No. C4040-03, Invitrogen Corp., Carlsbad, 
CA); pLenti67RNAi/DEST; pLenti6/RNAi/U6-GW/lamAC; OneShot STBL3 
cells; Virapower Bsd Lentiviral Support Kit (Catalog No. K4970-00, 
Invitrogen Corp., Carlsbad, CA); Gateway LR Clonase enzyme mix (Catalog 
No. 11791-091, Invitrogen Corp., Carlsbad, CA). 

Lentiviral RNAi DEST Kit 

[0304] pLenti6/RNAi/DEST; pLenti6/RNAi/U6-GW/lamAC; OneShot 

STBL3 cells; Gateway LR Clonase en2yme mix (Catalog No. 11791-019, 
Invitrogen Corp., Carlsbad, CA) 
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Matta et al, Use of lentiviral vectors for delivery of small interfering RNA. 
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Qin et al, Inhibiting HTV-1 infection in human T cells by lentiviral-mediated 
delivery of small interfering RNA against CCR5. Proc. Natl Acad. Sci. 
(USA) i 00:183-188 (2003) 
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Stewart et al, Lentivirus-delivered stable gene silencing by RNAi in primary 
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(USA) 700:1844-1848 (2003) 



Table 2. LxR Assay 



Sample fv' 


I \ Criteria 


■*.£, Values^ V 

ft** . * " ".» 


Pass/F^ail 


Cells only 


0 cfu/ng DNA 


0 cfu/^g DNA 


Pass \ 


No DNA 


0 cfu/^g DNA 


0 cfu/^g DNA 


Pass 


DEST vector only 


<1100cfu/|ig DNA 


660 cfu/fig DNA 


Pass 


LxR Reaction (n = 2) 


> 1 .65 x 1 0 6 cf u/^g DNA 


2.31 x 10 6 cfu/^ig DNA 


Pass 


pUC19only (n = 2) 


>7.5x10 8 cfu/^ig DNA 


2.53x10 10 cfu/ng DNA 


Pass 



Table 3. ccdB Assay 



Sample 


Cell Type 


Antibiotic 


Transformation Efficiency 


Cells Only 


DB3.1 


Amp 


OcfiiAxgDNA 






Kan 


0 cfuAxgDNA 


pUC19only(n=4) 


DB3.1 


Amp 


7.0X106 cfu/|xgDNA 


DEST vector only 
(n=4) 


DB3.1 


Amp 


3.0 X106cfu/tigDNA 


Cells Only 


TOP10 


Amp 


Ocfu/^igDNA 






Kan 


Ocfu/iigDNA 


pUC19only(n=4) 


TOP10 


Amp 


2.65X108 cfoAigDNA 


DEST vector only 
(n=4) 


TOP10 


Amp 


5.75 X 103 cfu/figDNA 






Kan 


Ocfu/ngDNA 


Fold-killing (criteria = 
1 x 104) 






2 x 104 Pass 



[0305] The invention illustratively described herein suitably may be practiced 

in the absence of any element or elements, limitation or limitations which is 
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not specifically disclosed herein. Thus, for example, in each instance herein 
any of the terms "comprising," "consisting essentially of," and "consisting of 
may be replaced with either of the other two terms. The terms and expressions 
that have been employed are used as terms of description and not of limitation, 
and there is no intention that in the use of such terms and expressions of 
excluding any equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. Thus, it should be understood that although 
the present invention has been specifically disclosed herein, optional features, 
modification and variation of the concepts herein disclosed may be resorted to 
by those skilled in the art, and that such modifications and variations are 
considered to be within the scope of this invention as defined by the appended 
claims. In addition, where features or aspects of the invention are described in 
terms of Markush groups, those skilled in the art will recognize that the 
invention is also thereby described in terms of any individual member or 
subgroup of members of the Markush group. 

[0306] The invention has been described broadly and generically herein. Each 

of the narrower species and subgeneric groupings falling within the generic 
disclosure also form part of the invention. This includes the generic 
description of the invention with a proviso or negative limitation removing 
any subject matter from the genus, regardless of whether or not the excised 
material is specifically recited herein. Other aspects of the invention are 
within the following claims. 

[0307] All publications, patents and patent applications mentioned in this 

specification are indicative of the level of skill of those skilled in the art to 
which this invention pertains, and are herein incorporated by reference to the 
same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 
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Table 5: pENTRU6 Vector Nucleic Acid Sequence 



CTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCT 

TTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCA 

GCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAAC 

CGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGA 

CAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAA 

TACGCGTACCGCTAGCCAGGAAGAGTTTGTAGAAACGCAAAAAGG 

CCATCCGTCAGGATGGCCTTCTGCTTAGTTTGATGCCTGGCAGTTTA 

TGGCGGGCGTCCTGCCCGCCACCCTCCGGGCCGTTGCTTCACAACG 

TTCAAATCCGCTCCCGGCGGATTTGTCCTACTCAGGAGAGCGTTCA 

CCGACAAACAACAGATAAAACGAAAGGCCCAGTCTTCCGACTGAG 

CCTTTCGTTTTATTTGATGCCTGGCAGTTCCCTACTCTCGCGTTAAC 1 

GCTAGCATGGATGTTTTCCCAGTCACGACGTTGTAAAACGACGGCC 

AGTCTTAAGCTCGGGCCCCAAATAATGATTTTATTTTGACTGATAGT 

GACCTGTTCGTTGCAACAAATTGATGAGCAATGCTTTTTTATAATGC 

CAACTTTGTACAAAAAAGCAGGCTTTAAAGGAACCAATTCAGTCGA 

CTGGATCCGGTACCAAGGTCGGGCAGGAAGAGGGCCTATTTCCCAT 

GATTCCTTCATATTTGCATATACGATACAAGGCTGTTAGAGAGATA 

ATTAGAATTAATTTGACTGTAAACACAAAGATATTAGTACAAAATA 

CGTGACGTAGAAAGTAATAATTTCTTGGGTAGTTTGCAGTTTTAAA 

ATTATGTTTTAAAATGGACTATCATATGCTTACCGTAACTTGAAAGT 

ATTTCGATTTCTTGGCTTTATATATCTTGTGGAAAGGACGAAACACC 

GGAGACCGCGGCCGCTGGATCCGGCTTACTAAAAGCCAGATAACA 

GTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTG 

ATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCA 

GCGTATTACAGTGACAGTTGACAGCGACAGCTATCAGTTGCTCAAG 

GCATATATGATGTCAATATCTCCGGTCTGGTAAGCACAACCATGCA 

GAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAA 

TCAGGAAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGG 

CTCTTTTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGT 

TTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTA 

CAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCCC 

TGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTA 

CCCGGTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCAC 

CGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGCT 

GATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTG 

ATGTTCTGGGGAATATAAGGTCTCATTTTTTTTCTAGACCCAGCTTT 

CITGTACAAAGTTGGCATTATAAGAAAGCATTGCTTATCAATTTGTT 

GCAACGAACAGGTCACTATCAGTCAAAATAAAATCATTATTTGCCA 

TCCAGCTGATATCCCCTATAGTGAGTCGTATTACATGGTCATAGCT 

GTTTCCTGGCAGCTCTGGCCCGTGTCTCAAAATCTCTGATGTTACAT 

TGCACAAGATAAAAATATATCATCATGAACAATAAAACTGTCTGCT 

TACATAAACAGTAA 
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Table 5 (continued): pENTRU6 Vector Nucleic Acid Sequence 

tacaaggggtgttatgagccatattcaacgggaaacgtcgagg.cc 

gcgattaaattccaacatggatgctgatttatatgggtataaatgg 

gctcgcgataatgtcgggcaatcaggtgcgacaatctatcgcttgt 

atgggaagcccgatgcgccagagttgtttctgaaacatggcaaag 

gtagcgttgccaatgatgttacagatgagatggtcagactaaactg 

gctgacggaatttatgcctcttccgaccatcaagcattttatccgta 

ctcctgatgatgcatggttactcaccactgcgatccccggaaaaac 

agcattccaggtattagaagaatatcctgattcaggtgaaaatatt 

gttgatgcgctggcagtgttcctgcgccggttgcattcgattcctgt 

ttgtaattgtccttttaacagcgatcgcgtatttcgtctcgctcagg 

cgcaatcacgaatgaataacggtttggttgatgcgagtgatttgat 

gacgagcgtaatggctggcctgttgaacaagtctggaaagaaatg 

cataaacttttgccattctcaccggattcagtcgtcactcatggtga 

tttctcacttgataaccttatttttgacgaggggaaattaataggtt 

gtattgatgttggacgagtcggaatcgcagaccgataccaggatct 

tgccatcctatggaactgcctcggtgagttttctccttcattacaga 

aacggctttttcaaaaatatggtattgataatcctgatatgaataa 

attgcagtttcatttgatgctcgatgagtttttctaatcagaattgg 

ttaattggttgtaacactggcagagcattacgctgacttgacggga 

cggcgcaagctcatgaccaaaatcccttaacgtgagttacgcgtcg 

ttccactgagcgtcagaccccgtagaaaagatcaaaggatcttctt 

gagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaa 

accaccgctaccagcggtggtttgtttgccggatcaagagctacca 

actctttttccgaaggtaactggcttcagcagagcgcagataccaa 

atactgtccttctagtgtagccgtagttaggccaccacttcaagaa 

ctctgtagcaccgcctacatacctcgctctgctaatcctgttaccag 

tggctgctgccagtggcgataagtcgtgtcttaccgggttggactc 

aagacgatagttaccggataaggcgcagcggtcgggctgaacgg 

ggggttcgtgcacacagcccagcttggagcgaacgacctAcaccg 

aactgagatacctacagcgtgagcattgagaaagcgccacgcttcc 

cgaagggagaaaggcggacaggtatccggtaagcggcagggtcgg 

aacaggagagcgcacgagggagcttccagggggaaacgcctggta 

tctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgat 

ttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccag 

caacgcggcctttttacggttcctggccttttgctggccttttgctc 

acatgtt seqidno:! 
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Table 6: Nucleotide sequence of plasmid pLenti6W5-DEST. 

AATGTAGTCTTATGCAATACTCTTGTAGTCTTGCAACATGGTAACG 

ATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCA 

TGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAA 

GGCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTGC 

CGCATTGCAGAGATATTGTATTTAAGTGCCTAGCTCGATACATAAA 

CGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

TAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAG 

TGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAG 

AGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTG 

GCGCCCGAACAGGGACTTGAAAGCGAAAGGGAAACCAGAGGAGCT 

CTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACGGCAAGAGG 

CGAGGGGCGGCGACTGGTGAGTACGCCAAAAATTTTGACTAGCGG 

AGGCTAGAAGGAGAGAGATGGGTGCGAGAGCGTCAGTATTAAGCG 

GGGGAGAATTAGATCGCGATGGGAAAAAATTCGGTTAAGGCCAGG 

GGGAAAGAAAAAATATAAATTAAAACATATAGTATGGGCAAGCAG 

GGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCA 

GAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAG 

ACAGGATCAGAAGAACTTAGATCATTATATAATACAGTAGCAACCC 

TCTATTGTGTGCATCAAAGGATAGAGATAAAAGACACCAAGGAAG 

CTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGACCACCG 

CACAGCAAGCGGCCGCTGATCTTCAGACCTGGAGGAGGAGATATG 

AGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTAGTAAAA 

ATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTG 

GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 

GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATG 

ACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGC 

AGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTT 

GCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCT 

GGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTG 

GGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 

GCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATCACACGA 

CCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAA 

TACACTCCTTAATTGAAGAATCGCAAAACCAGCAAGAAAAGAATG 

AAC AAGAATT ATT GGAATT AGAT AAAT GGGCAAGTTTGTGGAATTG 

GTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATG 

ATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTACTTTC 

TATAGTGAATAGAGTTAGGCAGGGATATTCACCATTATCGTTTCAG 

ACCCACCTCCCAACCCCGAGGGGACCCGACAGGCCCGAAGGAATA 

GAAGAA.GAAGGTGGAGAGAGAGACAGAGACAGATCCATTCGATTA 

GTGAACGGATCTCGACGGTATCGATAAGCTTGGGAGTTCCGCGTTA 
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Table 6 (continued). 

Nucleotide sequence of plasmid pLenti6/V5-DEST. 

CATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCC 

CCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 

ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAA 

CTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCC 

CCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCC 

CAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGT 

ATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATC 

AATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 

ACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACG 

GKjACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATG 

GGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTT 

TAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGA 

CCTCCATAGAAGACACCGACTCTAGAGGATCCACTAGTCCAGTGTG 

GTGGAATTCTGCAGATATCAACAAGTTTGTACAAAAAAGCTGAACG 

AGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTG 

CATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAG 

TCACTATGGCGGCCGCATTAGGCACCCCAGGCTTTACACTTTATGC 

TTCCGGCTCGTATAATGTGTGGATTTTGAGTTAGGATCCGGCGAGA 

TTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGA 

TATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTG 

AGGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCA 

GCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCAC 

AAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGC 

TCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATA 

TGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTG 

AAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGCA 

GTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAAC 

CTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTC 

AGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCC 

AATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATT 

ATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCA 

TCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAA 

TTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAAAGATCT 

GGATCCGGCTTACTAAAAGCCAGATAACAGTATGCGTATTTGCGCG 

CTGATTTTTGCGGTATAAGAATATATACTGATATGTATACCCGAAG 

TATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACA 

GTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAA 

TATCTCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGT 

CTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGGCT 

GAGGTCGCCCGGTTTATTGAAATGAACG 
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Table 6 (continued). Nucleotide sequence of plasmid pLenti6/V5-DEST. 

GCTCTTTTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGG 

TTTACACCTATAAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGT 

ACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTGATCCCC 

CTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTT 

ACCCGGTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCA 

CCGATATGGCCAGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGC 

TGATCTCAGCCACCGCGAAAATGACATCAAAAACGCCATTAACCTG 

ATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCAGT 

CTGCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTATT 

ATGTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGATATTT 

ATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTGGTTGA 

TATCCAGCACAGTGGCGGCCGCTCGAGTCTAGAGGGCCCGCGGTTC 

GAAGGTAAGCCTATCCCTAACCCTCTCCTCGGTCTCGATTCTACGC 

GTACCGGTTAGTAATGAGTTTGGAATTAATTCTGTGGAATGTGTGT 

CAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGGCAGGCAGAAG 

TATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAG 

TCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCA 

ATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCC 

CCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAA 

TTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCA 

AAAAGCTCCCGGGAGCTTGTATATCCATTTTCGGATCTGATCAGCA 

CGTGTTGACAATTAATCATCGGCATAGTATATCGGCATAGTATAAT 

ACGACAAGGTGAGGAACTAAACCATGGCCAAGCCTTTGTCTCAAG 

AAGAATCCACCCTCATTGAAAGAGCAACGGCTACAATCAACAGCA 

TCCCCATCTCTGAAGACTACAGCGTCGCCAGCGCAGCTCTCTCTAG 

CGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGGG 

GGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGCTGCTGCTGCGG 

CAGCTGGCAACCTGACTTGTATCGTCGCGATCGGAAATGAGAACAG 

GGGCATCTTGAGCCCCTGCGGACGGTGCCGACAGGTGCTTCTGGAT 

CTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGGACAG 

CCGACGGCAGTTGGGATTCGTGAATTGCTGCCCTCTGGTTATGTGT 

GGGAGGGCTAAGCACAATTCGAGCTCGGTACCTTTAAGACCAATG 

ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGG 

GGGGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATCTGC 

TTTTTGCTTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGG 

GAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGAC 

TCT 
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Table 6 (continued). Nucleotide sequence of plasmid pLenti6/V5-DEST. 

GGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCT 

CTAGCAGTAGTAGTTCATGTCATCTTATTATTCAGTATTTATAACTT 

GCAAAGAAATGAATATCAGAGAGTGAGAGGAACTTGTTTATTGCA 

GCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAA 

ATCAATGTATCTTATCATGTCTGGCTCTAGCTATCCCGCCCCTAACT 
CCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCC 

CGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCITrTTTGGAGG 

CCTAGGGACGTACCCAATTCGCCCTATAGTGAGTCGTATTACGCGC 

GCTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGG 

CGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCT 

GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTT 

GCGCAGCCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGCATT 

AAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTT 

GCCAGCGCCCTAGCGCCCGCTCCmCGCTTTCTTCCCTTCCTTTCTC 

GCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCC 

CTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAA 

ACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAG 

ACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGG 

ACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATT 

CTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAA 

AATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATAT 

TAACGCTTACAATTTAGGTGGCACTTTTCGGGGAAATGTGCGCGGA 

ACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCT 

CATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGG 

AAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTT 

tgcggcattttgccttcctgtttttgctcaccgagaaacgctggtga 

aagtaaaagatgctgaagatcagttgggtgcacgagtgggttaca 

tcgaactggatctcaacagcggtaagatccttgagagttttcgccc 

cgaagaacgttttccaatgatgagcacttttaaagttctgctatgt 

ggcgcggtattatcccgtattgacgccgggcaagagcaactcggtc 

gccgcatacactattctcagaatgacttggttgagtactcaccagt 

cacagaaaagcatcttacggatggcatgacagtaagagaattatg 

cagtgctgccataaccatgagtgataacactgcggccaacttactt 

ctgacaacgatcggaggaccgaaggagctaaccgcttttttgcaca 

acatgggggatcatg'taactcgccttgatcgttgggaaccggagct 

gaatgaagccataccaaacgacgagcgtgacaccacgatgcctgt 

agcaatggcaacaacgttgcgcaaactattaactggcgaactactt 

actctagcttcccggcaacaattaatagactggatggaggcggata 

aagttgcaggaccacttctgcgctcggcccttccggctggctggtt 
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Table 6 (continued). Nucleotide sequence of plasmid pLenti6/V5-DEST. 

TATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATC 

ATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTA 

TCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGAC 

AGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTC 

AGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATT 

TTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATG 

ACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACC 

CCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGC 

GTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGG 

TTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACT 

GGCTTCAGCAGAGCGCAGATACCAAATACTGTTCTTCTAGTGTAGC 

CGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATA 

CCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGAT 

AAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATA 

AGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCA 

GCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTG 

AGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACA 

GGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGG 

AGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTT 

TCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGG 

GGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTT 

CCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATC 

CCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGAT 

ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGC 

GAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCG 

CGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACT 

GGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCA 

CTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATG 

TTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCT 

ATGACCATGATTACGCCAAGCGCGCAATTAACCCTCACTAAAGGGA 

ACAAAAGCTGGAGCTGCAAGCTT SEQIDNO:2 
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Table 7. Nucleotide sequence of plasmid pLenti6/V5-dTOPO™. 

AATGTAGTCTTATGCAATACTCTTGTAGTCTTGCAACATGGTAACG 

ATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCA 

TGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAA 

GGCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTGC 

CGCATTGCAGAGATATTGTATTTAAGTGCCTAGCTCGATACATAAA 

CGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

TAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAG 

TGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAG 

AGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTG 

GCGCCCGAACAGGGACTTGAAAGCGAAAGGGAAACCAGAGGAGCT 

CTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACGGCAAGAGG 

CGAGGGGCGGCGACTGGTGAGTACGCCAAAAATTTTGACTAGCGG 

AGGCTAGAAGGAGAGAGATGGGTGCGAGAGCGTCAGTATTAAGCG 

GGGGAGAATTAGATCGCGATGGGAAAAAATTCGGTTAAGGCCAGG 

GGGAAAGAAAAAATATAAATTAAAACATATAGTATGGGCAAGCAG 

GGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCA 

GAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAG 

ACAGGATCAGAAGAACTTAGATCATTATATAATACAGTAGCAACCC 

TCTATTGTGTGCATCAAAGGATAGAGATAAAAGACACCAAGGAAG 

CTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGACCACCG 

CACAGCAAGCGGCCGCTGATCTTCAGACCTGGAGGAGGAGATATG 

AGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTAGTAAAA 

ATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTG 

GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 

GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATG 

ACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGC 

AGCAGAA.CAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTT 

GCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCT 

GGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTG 

GGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 

GCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATCACACGA 

CCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAA 

TACACTCCTTAATTGAAGAATCGCAAAACCAGCAAGAAAAGAATG 

AACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTGTGGAATTG 

GTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATG 

ATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTACTTTC 

TATAGTGAATAGAGTTAGGCAGGGATATTCACCATTATCGTTTCAG 

ACCCACCTCCCAACCCCGAGGGGACCCGACAGGGCCGAAGGAATA 

GAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATCCATTCGATTA 

GTGAACGGATCTCGACGGTATCGATAAGCTTGGGAGTTCCGCGTTA 



-121- 



Table 7 (continued). Nucleotide sequence of plasmid pLenti6/V5-dTOPO™. 

CATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCC 

CCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 

ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAA 

CTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCC 

CCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCC 

CAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGT 

ATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATC 

AATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCC 

ACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACG 

GGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATG 

GGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTT 

TAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGA 

CCTCCATAGAAGACACCGACTCTAGAGGATCCACTAGTCCAGTGTG 

GTGGAATTGATCCCTTCACCAAGGGCTCGAGTCTAGAGGGCCCGCG 

GTTCGAAGGTAAGCCTATCCCTAACCCTCTCCTCGGTCTCGATTCTA 

CGCGTACCGGTTAGTAATGAGTTTGGAATTAATTCTGTGGAATGTG 

TGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGGCAGGCAG 

AAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGA 

AAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCAT 

CTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCC 

CGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGA 

CTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCTGCCTCTGA 

GCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTT 

TGCAAAAAGCTCCCGGGAGCTTGTATATCCATTTTCGGATCTGATC 

AGCACGTGTTGACAATTAATCATCGGCATAGTATATCGGCATAGTA 

TAATACGACAAGGTGAGGAACTAAACCATGGCCAAGCCTTTGTCTC 

AAGAAGAATCCACCCTCATTGAAAGAGCAACGGCTACAATCAACA 

GCATCCCCATCTCTGAAGACTACAGCGTCGCCAGCGCAGCTCTCTC 

TAGCGACGGCCGCATCTTCACTGGTGTCAATGTATATCATTTTACTG 

GGGGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGCTGCTGCTGC 

GGCAGCTGGCAACCTGACTTGTATCGTCGCGATCGGAAATGAGAAC 

AGGGGCATCTTGAGCCCCTGCGGACGGTGCCGACAGGTGCTTCTCG 

ATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACAGTGATGGAC 

AGCCGACGGCAGTTGGGATTCGTGAATTGCTGCCCTCTGGTTATGT 

GTGGGAGGGCTAAGCACAATTCGAGCTCGGTACCTTTAAGACCAAT 

GACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAG 

GGGGGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATCTG 

CTTTTTGCTTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTG 

G 
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Table 7 (continued). Nucleotide sequence of plasmid pLenti6/V5-dTOPO™. 

GAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGAC 

TCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAA 

TCTCTAGCAGfAGTAGTTCATGTCATCTTATTATTCAGTATTTATAA 

CTTGCAAAGAAATGAATATCAGAGAGTGAGAGGAACTTGTTTATTG 

CAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCAC 

AAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAAC 

TCATCAATGTATCTTATCATGTCTGGCTCTAGCTATCCCGCCCCTAA 

CTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCG 

CCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGC 

GGCCTAGGGACGTACCCAATTCGCCCTATAGTGAGTCGTATTACGC 

GCGCTCACTGGCCGTCGTTTTAGAACGTCGTGACTGGGAAAACCCT 

GGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCA 

GCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACA 

GTTGCGCAGCCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGC 

ATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACA 

CTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTT 

CTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGC 

TCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAA 

AAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGAT 

AGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGT 

GGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCT 

ATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTA 

AAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAA 

TATTAACGCTTACAATTTAGGTGGCACTTTTCGGGGAAATGTGCGC 

GGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCC 

GCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAA 

AGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTT 

TTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGG 

TGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTT 

ACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCG 

CCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTA 

TGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCG 

GTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACC 

AGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATT 

ATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 

CTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGC 

ACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGA 

GCTGAATGAAGCCAT 
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Table 7 (continued). Nucleotide sequence of plasmidpLenti6A^5-dTOPO™. 

ACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAAC 

AACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCC 

CGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGA 

CCAOTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAA 

ATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTG 

GGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGG 

GGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGA 

TAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTA 

CTCATATATACTTTAGATTGATTTAAAAOTCATTTTTAATTTAAAA 

GGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCC 

TTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAG 

ATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTG 

CTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCG 

GATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCA 

GAGCGCAGATACCAAATACTGTTCTTCTAGTGTAGCCGTAGTTAGG 

CCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTG 

CTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 

TTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGC 

GGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGC 

GAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAG 

AAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGG 

TAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAG 

GGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCT 

CTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGC 

CTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCT 

TTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATT 

CTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCG 

CCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGC 

GGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCG 

ATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGG 

GCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGC 

ACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA 

TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGA 

TTACGCCAAGCGCGCAATTAACCCTCACTAAAGGGAACAAAAGCT 

GGAGCTGCAAGCTT SEQIDNO:3 
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Table 8. Nucleotide sequence of pLenti4/V 5-DEST. 

AATGTAGTCTTATGCAATACTCTTGTAGTCTTGCAACATGGTAACG 

ATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCA 

TGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAA 

GGCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTGC 

CGCATTGCAGAGATATTGTATTTAAGTGCCTAGCTCGATACATAAA 

CGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

TAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAG 

TGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAG 

AGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTG 

GCGCCCGAACAGGGACTTGAAAGCGAAAGGGAAACCAGAGGAGCT 

CTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACGGCAAGAGG 

CGAGGGGCGGCGACTGGTGAGTACGCCAAAAATTTTGACTAGCGG 

AGGCTAGAAGGAGAGAGATGGGTGCGAGAGCGTCAGTATTAAGCG 

GGGGAGAATTAGATCGCGATGGGAAAAAATTCGGTTAAGGCCAGG 

GGGAAAGAAAAAATATAAATTAAAACATATAGTATGGGCAAGCAG 

GGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCA 

GAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAG 

ACAGGATCAGAAGAACTTAGATCATTATATAATACAGTAGCAACCC v 

TCTATTGTGTGCATCAAAGGATAGAGATAAAAGACACCAAGGAAG 

CTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGACCACCG 

CACAGCAAGCGGCCGCTGATCTTCAGACCTGGAGGAGGAGATATG 

AGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTAGTAAAA 

ATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTG 

GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 

GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATG 

ACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGC 

AGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTT 

GCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCT 

GGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTG 

GGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 

GCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATCACACGA 

CCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAA 

TACACTCCTTAATTGAAGAATCGCAAAACCAGCAAGAAAAGAATG 

AACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTGTGGAATTG 

GTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATG 

ATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTACTTTC 

TATAGTGAATAGAGTTAGGCAGGGATATTCACCATTATCGTTTCAG 

ACCCACCTCCCAACCCCGAGGGGACCCGACAGGCCCGAAGGAATA 

GAAGAAGAAGGTGGAGAGAGA 
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Table 8 (continued). Nucleotide sequence of pLenti4/V5-DEST. 

GACAGAGACAGATCCATTCGATTAGTGAACGGATCTCGACGGTATC 

GATAAGCTTGGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCC 

GCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATG 

ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTC 

AATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCA 

AGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGT 

AAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACT 

TTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATG 

GTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTG 

ACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAG 

TTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAAC 

AACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGG 

GAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCT 

GGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGACT 

CTAGAGGATCCACTAGTCCAGTGTGGTGGAATTCTGCAGATATCAA 

CAAGTTTGTACAAAAAAGCTGAACGAGAAACGTAAAATGATATAA 

ATATCAATATATTAAATTAGATTTTGCATAAAAAACAGACTACATA 

ATACTGTAAAACACAACATATCCAGTCACTATGGCGGCCGCATTAG 

GCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGG 

ATTTTGAGTTAGGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCTA 

AAATGGAGAAAAAAATCACTGGATATACCACCGTTGATATATCCCA 

ATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAA 

TGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAA 

AGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCA 

CATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTATGGCA 

ATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTT 

ACACCGTTTrCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAG 

TGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAA 

GATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGT 

TTATTGAGAATATGTTTTTCGTCTCAGCCAATCCCTGpGTGAGTTTC 

ACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCC 

CCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCT 

GATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTC 

CATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGT 

GGCAGGGCGGGGCGTAAAGATCTGGATCCGGCTTACTAAAAGCCA 

GATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAAT 

ATATACTGATATGTATACCCGAAG 
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Table 8 (continued). Nucleotide sequence of pLenti4/V5-DEST. 

TATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACA 

GTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAA 

TATCTCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCCCGTCGT 

CTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGGGATGGCT 

GAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTGACGAGA 

ACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGA 

GAGAGCCGTTATCGTCTGTTTGTGGATGTACAGAGTGATATTATTG 

ACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGCACGTCT 

GCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTGCATATC 

GGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTG 

CCGGTCTCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCG 

AAAATGACATCAAAAACGCCATTAACCTGATGTTCTGGGGAATATA 

AATGTCAGGCTCCGTTATACACAGCCAGTCTGCAGGTCGACCATAG 

TGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTAT 

GCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCT 

CGTTCAGCTTTCTTGTACAAAGTGGTTGATATCCAGCACAGTGGCG 

GCCGCTCGAGTCTAGAGGGCCCGCGGTTCGAAGGTAAGCCTATCCC 

TAACCCTCTCCTCGGTCTCGATTCTACGCGTACCGGTTAGTAATGAG 

TTTGGAATTAATTCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAG 

TCCCCAGGCTCCCCAGGCAGGCAGAAGTATGCAAAGCATGCATCTC 

AATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCA 

GGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATA 

GTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTC 

CGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAG 

AGGCCGAGGCCGCCTCTGCCTCTGAGCTATTCCAGAAGTAGTGAGG 

AGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCCCCCTGTTG 

ACAATTAATCATCGGCATAGTATATCGGCATAGTATAATACGACAA 

GGTGAGGAACTAAACCATGGCCAAGTTGACCAGTGCCGTTCCGGTG 

CTCACCGCGCGCGACGTCGCCGGAGCGGTCGAGTTCTGGACCGACC 

GGCTCGGGTTCTCCCGGGACTTCGTGGAGGACGACTTCGCCGGTGT 

GGTCCGGGACGACGTGACCCTGTTCATCAGCGCGGTCCAGGACCAG 

GTGGTGCCGGACAACACCCTGGCCTGGGTGTGGGTGCGCGGCCTGG 

ACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCG 

GGACGCCTCCGGGCCGGCCATGACCGAGATCGGCGAGCAGCCGTG 

GGGGCGGGAGTTCGCCCTGCGCGACCCGGCCGGCAACTGCGTGCA 

CTTCGTGGCCGAGGAGCAGGACTGACACGTGCTACGAGATTTAAAT 

GGTACCTTTAAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGC 

CACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCC 

AACGAAGACAAGATCTGCTTTTTGCTTGTACTGGGTCTCTCTGGTTA 

GACCAGATCTGAGCCTGGGAGCTCTCTG 
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Table 8 (continued). Nucleotide sequence of pLenti4/V5-DEST. 

GCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTG 

AGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACT 

AGAGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAG 

TAGTAGTTCATGTCATCTTATTATTCAGTATTTATAACTTGCAAAGA 

AATGAATATCAGAGAGTGAGAGGAACTTGTTTATTGCAGCTTATAA 

TGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGC 

ATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATG 

TATCTTATCATGTCTGGCTCTAGCTATCCCGCCCCTAACTCCGCCCA 

TCCCGCCCCTAACTCCGCCCAGTTCCGGCCATTCTCCGCCCCATGGC 

TGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTC 

TGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGG 

ACGTACCCAATTCGCCCTATAGTGAGTCGTATTACGCGCGCTCACT 

GGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACC 

CAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTA 

ATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAG 

CCTGAATGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGC 

GGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGC 

GCCCTAGCGCCCGCrCCTTTCGCTTTCTtCCCTTCCTTTCTCGCCACG 

TTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGG 

GTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGAT 

TAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTT 

TTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTG 

TTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGA 

TTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAG 

CTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGC 

TTACAATTTAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT 

ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAG 

ACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAG 

TATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGG 

CATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTA 

AAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAA 

CTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAG 

AACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGC 

GGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGC 

ATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAG 

AAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTG 

CTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGAC 

AACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATG 

GGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATG 

AAGCCATACCAAACGAC 
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Table 8 (continued). Nucleotide sequence of pLenti4/V5-DEST. 

GAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGC 

AAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAAT 

TAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGC 

GCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGC 

CGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT 

GGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGG 

CAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCT 

CACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATAT 

ACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGG 

TGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGA 

GTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGA 

TCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAAC 

AAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAG 

CTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGA 

TACCAAATACTGTTCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTC 

AAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTT 

ACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 

GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGA 

ACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTAC 

ACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACG 

CTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCC 

TGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCG 

TCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAAC 

GCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTT 

TGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACC 

GTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAAC 

GACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCC 

AATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGC 

AGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGGGC 

AACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTT 

TACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGA 

TAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGC 

GCGCAATTAACCCTCACTAAAGGGAACAAAAGCTGGAGCTGCAAG 

CTT SEQIDNO:4 
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Table 9. Nucleotide sequence of pLenti6/UbC/V5-DEST. 

AATGTAGTCTTATGCAATACTGTTGTAGTCTTGCAACATGGTAACG 

ATGAGTTAGCAACATGCCTTACAA.GGAGAGAAAAAGCACCGTGCA 

TGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAA 

GGCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTGC 

CGCATTGCAGAGATATTGTATTTAAGTGCCTAGCTCGATACATAAA 

CGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

TAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAG 

TGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAG 

AGATCCCTCAGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTG 

GCGCCCGAACAGGGACTTGAAAGCGAAAGGGAAACCAGAGGAGCT 

CTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACGGCAAGAGG 

CGAGGGGCGGCGACTGGTGAGTACGCCAAAAATTTTGACTAGCGG 

AGGCTAGAAGGAGAGAGATGGGTGCGAGAGCGTCAGTATTAAGCG 

GGGGAGAATTAGATCGCGATGGGAAAAAATTCGGTTAAGGCCAGG 

GGGAAAGAAAAAATATAAATTAAAACATATAGTATGGGCAAGCAG 

GGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCA 

GAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAG 

ACAGGATCAGAAGAACTTAGATCATTATATAATACAGTAGCAACCC 

TCTATTGTGTGCATCAAAGGATAGAGATAAAAGACACCAAGGAAG 

CTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGACCACCG 

CACAGCAAGCGGCCGCTGATCTTCAGACCTGGAGGAGGAGATATG 

AGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTAGTAAAA 

ATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTG 

GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTT 

GGGTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATG 

ACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGC 

AGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTT 

GCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCT 

GGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTG 

GGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAAT 

GCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATCACACGA 

CCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAA 

TACACTCCTTAATTGAAGAATCGCAAAACCAGCAAGAAAAGAATG 

AACAAGAATTATTGGAATTAGATAAATGGGCAAGTTTGTGGAATTG 

GTTTAACATAACAAATTGGCTGTGGTATATAAAATTATTCATAATG 

ATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGCTGTACTTTC 

TATAGTGAATAGAGTTAGGCAGGGATATTCACCATTATCGTTTCAG 

ACCCACCTGCCAACCCCGAGGGGACCCGACAGGCCCGAAGGAATA 

GAAGAAGAAGGTGGAGAGAGA 
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Table 9 (continued). Nucleotide sequence of pLenti6/UbC/V5-DEST. 

GACAGAGACAGATCCATTCGATTAGTGAACGGATCTCGACGGTATC 

GGATCTGGCCTCCGCGCCGGGTTTTGGCGCCTCCCGCGGGCGCCCC 

CCTCCTCACGGCGAGCGCTGCCACGTCAGACGAAGGGCGCAGGAG 

CGTCCTGATCCTTCCGCCCGGACGCTCAGGACAGCGGCCCGCTGCT 

CATAAGACTCGGCCTTAGAACCCCAGTATCAGCAGAAGGACATTTT 

AGGACGGGACTTGGGTGACTCTAGGGCACTGGTTTTCTTTCCAGAG 

AGCGGAACAGGCGAGGAAAAGTAGTCCCTTCTCGGCGATTCTGCG 

GAGGGATCTCCGTGGGGCGGTGAACGCCGATGATTATATAAGGAC 

GCGCCGGGTGTGGCACAGCTAGTTCCGTCGCAGCCGGGATTTGGGT 

CGCGGTTCTTGTTTGTGGATCGCTGTGATCGTCACTTGGTGAGTAGC 

GGGCTGCTGGGCTGGCCGGGGCTTTCGTGGCCGCCGGGCCGCTCGG 

TGGGACGGAAGCGTGTGGAGAGACCGCCAAGGGCTGTAGTCTGGG 

TCCGCGAGCAAGGTTGCCCTGAACTGGGGGTTGGGGGGAGCGCAG 

CAAAATGGCGGCTGTTCCCGAGTCTTGAATGGAAGACGCTTGTGAG 

GCGGGCTGTGAGGTCGTTGAAACAAGGTGGGGGGCATGGTGGGCG 

GCAAGAACCCAAGGTCTTGAGGCCTTCGCTAATGCGGGAAAGCTCT 

TATTCGGGTGAGATGGGCTGGGGCACCATCTGGGGACCCTGACGTG 

AAGTTTGTCACTGACTGGAGAACTCGGTTTGTCGTCTGTTGCGGGG 

GCGGCAGTTATGCGGTGCCGTTGGGCAGTGCACCCGTACCTTTGGG 

AGCGCGCGCCCTCGTCGTGTCGTGACGTCACCCGTTCTGTTGGCTTA 

TAATGCAGGGTGGGGCCACCTGCCGGTAGGTGTGCGGTAGGCTTTT 

CTCCGTCGCAGGACGCAGGGTTCGGGCCTAGGGTAGGCTCTCCTGA 

ATCGACAGGCGCCGGACCTCTGGTGAGGGGAGGGATAAGTGAGGC 

GTCAGTTTCTTTGGTCGGTTTTATGTACCTATCTTCTTAAGTAGCTG 

AAGCTCCGGTTTTGAACTATGCGCTCGGGGTTGGCGAGTGTGTTTT 

GTGAAGTTTTTTAGGCACCTTTTGAAA.TGTAATCATTTGGGTCAATA 

TGTAATTTTCAGTGTTAGACTAGTAAATTGTCCGCTAAATTCTGGCC 

GTTTTTGGCTTTTTTGTTAGACGAAGCTTGGTACCGAGCTCGGATCC 

ACTAGTCCAGTGTGGTGGAATTCTGCAGATATCAACAAGTTTGTAC 

AAAAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATA 

TTAAATTAGATTTTGCATAAAAAACAGACTACATAATACTGTAAAA 

CACAACATATCCAGTCACTATGGCGGCCGCATTAGGCACCCCAGGC 

TTTACACTTTATGGTTCCGGCTCGTATAA.TGTGTGGATTTTGAGTTA 

GGATCCGGCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAA 

AAAAATCACTGGATATACCACCGTTGATATATCCCAATGGCATCGT 

AAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATA 

ACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAA. 

GAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCC 

GCC 
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Table 9 (continued). Nucleotide sequence of pLenti6/UbC/V5-DEST. 

TGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGA 

GCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATG 

AGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGA 

TTTCCGGCAGTTTCTACACATATATTCGCAAGATGJ'GGCGTGTTACG 

GTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTT 

TTTCGTCTCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAA 

ACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGG 

CAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATT 

CAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGC 

TTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGT 

AAAGATCTGGATCCGGCTTACTAAAAGCCAGATAACAGTATGCGTA 

TTTGCGCGCTGATTTTTGCGGTATAAGAATATATACTGATATGTATA 

CCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTAC 

AGTGACAGTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATG 

ATGTCAATATCTCCGGTCTGGTAAGCACAACCATGCAGAATGAAGC 

CCGTCGTCTGCGTGCCGAACGCTGGAAAGCGGAAAATCAGGAAGG 

GATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCT 

GACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTAT 

AAAAGAGAGAGCCGTTATCGTCTGTTTGTGGATGTACAGAGTGATA 

TTATTGACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCAGTGC 

ACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTGGTG 

CATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCC 

AGTGTGCCGGTCTCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCC 

ACCGCGAAAATGACATCAAAAACGCCATTAACCTGATGTTCTGGGG 

AATATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCAGGTCGA 

CCATAGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTT 

TTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTAC 

GTTTCTCGTTCAGCTTTCTTGTACAAAGTGGTTGATATCCAGCACAG 

TGGCGGCCGCTCGAGTCTAGAGGGCCCGCGGTTCGAAGGTAAGCCT 

ATCCCTAACCCTCTCCTCGGTCTCGATTCTACGCGTACCGGTTAGTA 

ATGAGTTTGGAATTAATTCTGTGGAATGTGTGTCAGTTAGGGTGTG 

GAAAGTCCCCAGGCTCCCCAGGCAGGCAGAAGTATGCAAAGCATG 

CATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCC 

CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAA 

CCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCC 

AGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTA 

TGCAGAGGCCGAGGCCGCCT 



-132- 



Table 9 (continued). Nucleotide sequence of pLenti6/UbC/V5-DEST. 

CTGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGG 

CCTAGGCTTTTGCAAAAAGCTCCCGGGAGCTTGTATATCCATTTTCG 

GATCTGATCAGCACGTGTTGACAATTAATCATCGGCATAGTATATC 

GGCATAGTATAATACGACAAGGTGAGGAACTAAACCATGGCCAAG 

CCTTTGTCTCAAGAAGAATCCACCCTCATTGAAAGAGCAACGGCTA 

CAATCAACAGCATCCCCATCTCTGAAGACTACAGCGTCGCCAGCGC 

AGCTCTCTCTAGCGACGGCCGCATCTTCACTGGTGTCAATGTATATC 

ATTTTACTGGGGGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGC 

TGCTGCTGCGGCAGCTGGCAACCTGACTTGTATCGTCGCGATCGGA 

AATGAGAACAGGGGCATCTTGAGCCCCTGCGGACGGTGCCGACAG 

GTGCTTCTCGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGACA 

GTGATGGACAGCCGACGGCAGTTGGGATTCGTGAATTGCTGCCCTC 

TGGTTATGTGTGGGAGGGCTAAGCACAATTCGAGCTCGGTACCTTT 

AAGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTA 

AAAGAAAAGGGGGGACTGGAAGGGCTAATTCACTCCCAACGAAGA 

CAAGATCTGCTTTTTGCTTGTACTGGGTCTCTCTGGTTAGACCAGAT 

CTGAGCCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAG 

CCTCAATAAAGCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCT 

GTTGTGTGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCA 

GTGTGGAAAATCTCTAGCAGTAGTAGTTCATGTCATCTTATTATTCA 

GTATTTATAACTTGCAAAGAAATGAATATCAGAGAGTGAGAGGAA 

CTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATC 

TTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGCTCTAGCTAT 

CCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCG 

CCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAG 

GCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAG 

GCTTTTTTGGAGGCCTAGGGACGTACCCAATTCGCCCTATAGTGAG 

TCGTATTACGCGCGCTCACTGGCCGTCGTTTTACAACGTCGTGACTG 

GGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCC 

CCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCC 

CTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGCGCCCTG 

TAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGT 

GACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCT 

TCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCA 
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Table 9 (continued). Nucleotide sequence of pLenti6/UbC/V5-DEST. 

AGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTAC 

GGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAG 

TGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGT 

CCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACT 

CAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGA 

TTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAA 

C GCGAATTTTAACAAAATATTAACGCTTAC AATTTAGGTGGCACTT 

TTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATA 

CATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGC 

TTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCG 

TGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTT 

TCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTT 

GGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAA 

GATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGC 

ACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGC 

CGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGAC 

TTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCA 

TGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAA 

CACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGA 

GCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTT 

GATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 

CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAA 

CTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAA 

TAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTC 

GGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGT 

GAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTA 

AGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAAC 

TATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACT 

GATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTT 

AGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAA 

GATCCTTTTTGATAATCTCATGACCAAAATCCCTTAA.CGTGAGTTTT 

CGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTC 

TTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAA 

AACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACC 

AACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCA 

AATACTGTTCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGA 

ACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCA 

GTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACT 

CAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGG 

GGGGTTCGTGCACACAGCCCAG 
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Table 9 (continued). Nucleotide sequence of pLenti6/UbC/V5-DEST. 

CTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGA 

GCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAG 

GTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGA 

GCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTT 

CGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGG 

GGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTT 

CCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTITTCCTGCGTTATC 

CCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGAT 

ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGC 

GAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCG 

CGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACT 

GGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCA 

CTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATG 

TTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCT 

ATGACCATGATTACGCCAAGCGCGCAATTAACCCTCACTAAAGGGA 

ACAAAAGCTGGAGCTGCAAGCTT SEQIDNO:5 
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Table 10. Nucleotide sequence of plasmidpLPl. 

TTGGCCCATTGCATACGTTGTATCCATATCATAATATGTACATTTAT 

ATTGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTATTGAC 

TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCA 

TATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTG 

GCTGACCGCCCAACGACCCCGGCCCATTGACGTCAATAATGACGTA 

TGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGG 

GTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT 

ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATG 

GCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCT 

ACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGAT 

GCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCA 

CGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGT 

TTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTC 

CGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGT 

CTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 

CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGAT 

CCAGCCTCCCCTCGAAGCTTACATGTGGTACCGAGCTCGGATCCTG 

AGAACITCAGGGTGAGTCTATGGGACCCTTGATGTTTTCTTTCCCCT 

TCTTTTCTATGGTTAAGTTCATGTCATAGGAAGGGGAGAAGTAACA 

GGGTACACATATTGACCAAATCAGGGTAATTTTGCATTTGTAATTTT 

AAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCT 

AATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGT 

ATCATGCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTG 

GGTTAAGGCAATAGCAATATTTCTGCATATAAATATTTCTGCATAT 

AAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAGCTAC 

AATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGG 

ATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACC 

TCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGC 

TGGCCCATCACTTTGGCAAAGCACGTGAGATCTGAATTCGAGATCT 

GCCGCCGCCATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAA 

TTAGATCGATGGGAAAAAATTCGGTTAAGGCCAGGGGGAAAGAAA 

AAATATAAATTAAAACATATAGTATGGGCAAGCAGGGAGCTAGAA 

CGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCAGAAGGCTGTA 

GACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAG 

AAGAACTTAGATCATTATATAATACAGTAGCAACCCTCTATTGTGT 

GCATCAAAGGATAGAGATAAAAGACACCAAGGAAGCTTTAGACAA 

GATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAAGCACAGCAAG 

CAGCAGCTGACACAGGACACAGCAATCAGGTCAGCCAAAATTACC 

CTATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATC 

ACCTAGAACTTTAAATGCATGGG 
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Table 10 (continued). Nucleotide sequence of plasmid pLPl 

TAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTGATACCCA 

TGTTTTCAGCATTATCAGAAGGAGCCACCCCACAAGATTTAAACAC 

CATGCTAAACACAGTGGGGGGACATCAAGCAGCCATGCAAATGTT 

AAAAGAGACCATCAATGAGGAAGCTGCAGAATGGGATAGAGTGCA 

TCCAGTGCATGCAGGGCCTATTGCACCAGGCCAGATGAGAGAACC 

AAGGGGAAGTGACATAGCAGGAACTACTAGTACCCTTCAGGAACA 

AATAGGATGGATGACACATAATCCACCTATCCCAGTAGGAGAAAT 

CTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGAAT 

GTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGA 

ACCCTTTAGAGACTATGTAGACCGATTCTATAAAACTCTAAGAGCC 

GAGCAAGCTTCACAAGAGGTAAAAAATTGGATGACAGAAACCTTG 

TTGGTCCAAAA.TGCGAACCCAGATTGTAAGACTATTTTAAAAGCAT 

TGGGACCAGGAGCGACACTAGAAGAAATGATGACAGCATGTCAGG 

GAGTGGGGGGACCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCAA 

TGAGCCAAGTAACAAATCCAGCTACCATAATGATACAGAAAGGCA 

ATTTTAGGAACCAAAGAAAGACTGTTAAGTGTTTCAATTGTGGCAA 

AGAAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAAAGGG 

CTGTTGGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTAC 

TGAGAGACAGGCTAATTTTTTAGGGAAGATCTGGCCTTCCCACAAG 

GGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGAGCCAACAGCC 

CCACCAGAAGAGAGCTTCAGGTTTGGGGAAGAGACAACAACTCCC 

TCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAGCTT 

CCCTCAGATCACTCTTTGGCAGCGACCCCTCGTCACAATAAAGATA 

GGGGGGCAATTAAAGGAAGCTCTATTAGATACAGGAGCAGATGAT 

ACAGTATTAGAAGAAATGAATTTGCCAGGAAGATGGAAACCAAAA 

ATGATAGGGGGAATTGGAGGTTTTATCAAAGTAA.GACAGTATGATC 

AGATACTCATAGAAATCTGCGGACATAAAGCTATAGGTACAGTATT 

AGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACT 

CAGATTGGCTGCACTTTAAATTTTCCCATTAGTCCTATTGAGACTGT 

ACCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAACA 

ATGGCCATTGACAGAAGAAAAAATAAAAGCATTAGTAGAAATTTG 

TACAGAAATGGAAAAGGAAGGAAAAATTTCAAAAATTGGGCCTGA 

AAATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGACAGT 

ACTAAATGGAGAAAATTAGTAGATTTCAGAGAACTTAATAAGAGA 

ACTCAAGATTTCTGGGAAGTTCAATTAGGAATACCACATCCTGCAG 

GGTTAAAACAGAAAAAATCAGTAACAGTACTGGATGTGGGCGATG 

CATATTTTTCAGTTCCCTTAGATAAAGACTTCAGGAAGTATACTGC 

ATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATAT 

CAGTACAATGTGCTTCCACAGGGA 
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Table 10 (continued). Nucleotide sequence of plasmid pLPl 

TGGAAAGGATCACCAGCAATATTCCAGTGTAGCATGACAAAAATCT 

TAGAGCCTTTTAGAAAACAAAATCCAGACATAGTCATCTATCAATA 

CATGGATGATTTGTATGTAGGATCTGACTTAGAAATAGGGCAGCAT 

AGAACAAAAATAGAGGAACTGAGACAACATCTGTTGAGGTGGGGA 

TTTACCACACCAGACAAAAAACATCAGAAAGAACCTCCATTCCTTT 

GGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTAT 

AGTGCTGCCAGAAAAGGACAGCTGGACTGTCAATGACATACAGAA 

ATTAGTGGGAAAATTGAATTGGGCAAGTCAGATTTATGCAGGGATT 

AAAGTAAGGCAATTATGTAAACTTCTTAGGGGAACCAAAGCACTA 

ACAGAAGTAGTACCACTAACAGAAGAAGCAGAGCTAGAACTGGCA 

GAAAACAGGGAGATTCTAAAAGAACCGGTACATGGAGTGTATTAT 

GACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGGGCAA 

GGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATCTGA 

AAACAGGAAAGTATGCAAGAATGAAGGGTGCCCACACTAATGATG 

TGAAACAATTAACAGAGGCAGTACAAAAAATAGCCACAGAAAGCA 

TAGTAATATGGGGAAAGACTCCTAAATTTAAATTACCCATACAAAA 

GGAAACATGGGAAGCATGGTGGACAGAGTATTGGCAAGCCACCTG 

GATTCCTGAGTGGGAGTTTGTCAATACCCCTCCCTTAGTGAAGTTAT 

GGTACCAGTTAGAGAAAGAACCCATAATAGGAGCAGAAACTTTCT 

ATGTAGATGGGGCAGCCAATAGGGAAACTAAATTAGGAAAAGCAG 

GATATGTAACTGACAGAGGAAGACAAAAAGTTGTCCCCCTAACGG 

ACACAACAAATCAGAAGACTGAGTTACAAGCAATTCATCTAGCTTT 

GCAGGATTCGGGATTAGAAGTAAACATAGTGACAGACTCACAATA 

TGCATTGGGAATCATTCAAGCACAACCAGATAAGAGTGAATCAGA 

GTTAGTCAGTCAAATAATAGAGCAGTTAATAAAAAAGGAAAAAGT 

CTACCTGGCATGGGTACCAGCACACAAAGGAATTGGAGGAAATGA 

ACAAGTAGATAAATTGGTCAGTGCTGGAATCAGGAAAGTACTATTT 

TTAGATGGAATAGATAAGGCCCAAGAAGAACATGAGAAATATCAC 

AGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTACCACCTGTAG 

TAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTCAGCTAAAAG 

GGGAAGCCATGCATGGACAAGTAGACTGTAGCCCAGGAATATGGC 

AGCTAGATTGTACACATTTAGAAGGAAAAGTTATCTTGGTAGCAGT 

TCATGTAGCCAGTGGATATATAGAAGCAGAAGTAATTCCAGCAGA 

GACAGGGCAAGAAACAGCATACTTCCTCTTAAAATTAGCAGGAAG 

ATGGCCAGTAAAAACAGTACATACAGACAATGGCAGCAATTTCAC 

CAGTACTACAGTTAAGGCCGCCTGTTGGTGGGCGGGGATCAAGCA 

GGAATTTGGCATTCCCTACAATCCCCAAAGTCAAGGAGTAATAGAA 
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Table 10 (continued). Nucleotide sequence of plasmid pLPl 

TCTATGAATAAAGAATTAAAGAAAATTATAGGACAGGTAAGAGAT 

CAGGCTGAACATCTTAAGACAGCAGTACAAATGGCAGTATTCATCC 

ACAATTTTAAAAGAAAAGGGGGGATTGGGGGGTACAGTGCAGGGG 

AAAGAATAGTAGACATAATAGCAACAGACATACAAACTAAAGAAT 

TACAAAAACAAATTACAAAAATTCAAAATTTTCGGGTTTATTACAG 

GGACAGCAGAGATCCAGTTTGGAAAGGACCAGCAAAGCTCCTCTG 

GAAAGGTGAAGGGGCAGTAGTAATACAAGATAATAGTGACATAAA 

AGTAGTGCCAAGAAGAAAAGCAAAGATCATCAGGGATTATGGAAA 

ACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGA 

TTAACACATGGAATTCCGGAGCGGCCGCAGGAGCTTTGTTCCTTGG 

GTTCTTGGGAGCAGCAGGAAGCACTATGGGCGCAGCGTCAATGAC 

GCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAG 

CAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGC 

AACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAATCCTGG 

CTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGG 

GTTGCTCTGGAAAACTCATTTGCACCACTGCTGTGCCTTGGAATGCT 

AGTTGGAGTAATAAATCTCTGGAACAGATTTGGAATCACACGACCT 

GGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTCCGCG 

GAATTCACCCCACCAGTGCAGGCTGCCTATCAGAAAGTGGTGGCTG 

GTGTGGCTAATGCCCTGGCCCACAAGTATCACTAAGCTCGCTTTCTT 

GCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCCAACTA 

CTAAACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGC 

CTAATAAAAAACATTTATTTTCATTGCAATGATGTATTTAAATTATT 

TCTGAATATTTTACTAAAAAGGGAATGTGGGAGGTCAGTGCATTTA 

AAACATAAAGAAATGAAGAGCTAGTTCAAACCTTGGGAAAATACA 

CTATATCTTAAACTCCATGAAAGAAGGTGAGGCTGCAAACAGCTAA 

TGCACATTGGCAACAGCCCCTGATGCCTATGCCTTATTCATCCCTCA 

GAAAAGGATTCAAGTAGAGGCTTGATTTGGAGGTTAAAGTTTTGCT 

ATGCTGTATTTTACATTACTTATTGTTTTAGCTGTCCTCATGAATGT 

CTTTTCACTACCCATTTGCTTATCCTGCATCTCTCAGCCTTGACTCC 

ACTCAGTTCTCTTGCTTAGAGATACCACCTTTCCCCTGAAGTGTTCC 

TTCCATGTTTTACGGCGAGATGGTTTCTCCTCGCCTGGCCACTCAGC 

CTTAGTTGTCTCTGTTGTCTTATAGAGGTCTACTTGAAGAAGGAAA 

AACAGGGGGCATGGTTTGACTGTCCTGTGAGCCCTTCTTCCCTGCCT 

CCCCCACTCACAGTGACCCGGAATCCCTCGACATGGCAGTCTAGCA 

CTAGTGCGGCCGCAGATCTGCTTCCTCGCTCACTGACTCGCTGCGCT 

CGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGT 

AATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACAT 

GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACGGTAAAAAGGCCG 

CGTTGCTGGCGTTTTTCCATAGGCTCCGCC 
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Table 10 (continued). Nucleotide sequence of plasmid pLPl 

CCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGC 

GAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAA 

GCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATAC 

CTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTC 

ACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG 

GGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTAT 

CCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATC 

GCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTA 

TGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGC 

TACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAG 

TTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAAC 

CACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACG 

CGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGG 

GGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGT 

CATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAA 

AAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGT 

CTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGAT 

CTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGA 

TAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAAT 

GATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATA 

AACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACT 

TTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGT 

AAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCT 

ACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCA 

GCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTT 

GTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGA 

AGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGC 

ATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACT 

GGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGAC 

CGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACA 

TAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGG 

CGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGT 

AACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACC 

AGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAA 

AAGGGAATAAGGGCGAGACGGAAATGTTGAATACTCATACTCTTCC 

TTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGC 

GGATACATATTTGAA.TGTATTTAGAAAAATAAACAAATAGGGGTTC 

CGCGCACATTTCCCCGAAAAGTGCCACCTGACGGGATCCCCTGAGG 

GGGCCCCCATGGGCTAGAGGATCCGGCCTCGGCCTCTGCATAAATA 

AAAAAAATTAGTCAGCCATGAGC SEQIDNO:6 



I 
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Table 11. Nucleotide sequence ofplasmidpLP2. 

AATGTAGTCTTATGCAATACTCTTGTAGTCTTGCAACATGGTAACG 

ATGAGTTAGCAACATGCCTTACAAGGAGAGAAAAAGCACCGTGCA 

TGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAA 

GGCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTCC 

GCATTGCAGAGATATTGTATTTAAGTGCCTAGCTCGATACAATAAA 

CGCCATTTGACCATTCACCACATTGGTGTGCACCTCCAAGCTCGAG 

CTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTG 

TTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCCCTCG 

AAGCTAGTCGATTAGGCATCTCCTATGGCAGGAAGAAGCGGAGAC 

AGCGACGAAGACCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCT 

ATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCC 

GAAGGAATAGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATC 

CATTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTG 

CGGAGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTT 

GATTGTAACGAGGATTGTGGAACTTCTGGGACGCAGGGGGTGGGA 

AGCCCTCAAATATTGGTGGAATCTCCTACAATATTGGAGTCAGGAG 

CTAAAGAATAGTGCTGTTAGCTTGCTCAATGCCACAGCTATAGCAG 

TAGCTGAGGGGACAGATAGGGTTATAGAAGTAGTACAAGAAGCTT 

GGCACTGGCCGTCGTTTTACAACGTCGTGATCTGAGCCTGGGAGAT 

CTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTG 

CCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGG 

TAACTAGAGATCAGGAAAACCCTGGCGTTACCCAACTTAATCGCCT 

TGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCC 

CGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAAT 

GGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCA 

CACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGC 

ATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACA 

CTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTT 

CTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGC 

TCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAA 

AAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGAT 

AGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGT 

GGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGGCT 

ATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTA 

AAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAA 

TATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCT 

GATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTG 

ACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACA 

A 
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Table 11 (continued). Nucleotide sequence of plasmidpLP2. 

GCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGT 

CATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTAT 

TTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGT 

GGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTT 

CTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGA 

TAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAAC 

ATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTG 

TTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGA 

TCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGC 

GGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGA 

TGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATT 

GACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGA 

ATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGA 

TGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGT 

GATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGA 

AGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCG 

CCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGA 

CGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCG 

CAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAA 

TTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGC 

GCTCGGCCCrTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGC 

CGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT 

GGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGG 

CAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCT 

CACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATAT 

ACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGG 

TGAAGATCCITrTTGATAATCTCATGACCAAAATCCCTTAACGTGA 

GTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGA 

TCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAAC 

AAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAG 

CTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGA 

TACCAAATACTGTTCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTC 

AAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTT 

ACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTGTTACCGGGTTG 

GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGA 

ACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTAC 

ACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACG 

CTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

G 
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Table 1 1 (continued). Nucleotide sequence of plasmid pLP2. 

TCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCT 

GGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGT 

CGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACG 

CCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTT 

GCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCG 

TATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACG 

ACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCA 

ATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCA 

GCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCA 

ACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTT 

ACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGAT 

AACAATTTCACACAGGAAACAGCTATGACATGATTACGAATTCGAT 

GTACGGGCCAGATATACGCGTATCTGAGGGGACTAGGGTGTGTTTA 

GGCGAAAAGCGGGGCTTCGGTTGTACGCGGTTAGGAGTCCCCTCAG 

GATATAGTAGTTTCGCTTTTGCATAGGGAGGGGGA SEQ ID NO:7 
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Table 12. Nucleotide sequence of plasmid pLP/VSVG. 

TTGGCCCATTGCATACGTTGTATCCATATCATAATATGTACATTTAT 

ATTGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTATTGAC 

TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCA 

TATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTG 

GCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTA 

TGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGG 

GTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT 

ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATG 

GCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCT 

ACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGAT 

GCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCA 

CGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGT 

TTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTC 

CGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGT 

CTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGA 

CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGAT 

CCAGCCTCCCCTCGAAGCTTACATGTGGTACCGAGCTCGGATCCTG 

AGAACTTCAGGGTGAGTCTATGGGACCCTTGATGTTTTCTTTCCCCT 

TCTTTTCTATGGTTAAGTTCATGTCATAGGAAGGGGAGAAGTAACA 

GGGTACACATATTGACCAAATCAGGGTAATTTTGCATTTGTAATTTT 

AAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTC^ 

AATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGT 

ATCATGCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTG 

GGTTAAGGCAATAGCAATATTTCTGCATATAAATATTTCTGCATAT 

AAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAGCTAC 

AATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGG 

ATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACC 

TCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGC 

TGGCCCATCACTTTGGCAAAGCACGTGAGATCTGAATTCTGACACT 

ATGAAGTGCCTTTTGTACTTAGCCTTTTTATTCATTGGGGTGAATTG 

CAAGTTCACCATAGTTTTTCCACACAACCAAAAAGGAAACTGGAAA 

AATGTTCCTTCTAATTACCATTATTGCCCGTCAAGCTCAGATTTAAA 

TTGGCATAATGACTTAATAGGCACAGCCTTACAAGTCAAAATGCCC 

AAGAGTCACAAGGCTATTCAAGCAGACGGTTGGATGTGTCATGCTT 

CCAAATGGGTCACTACTTGTGATTTCCGCTGGTATGGACCGAAGTA 

TATAACACATTCCATCCGATCCTTCACTCCATCTGTAGAACAATGC 

AAGGAAAGCATTGAACAAACGAAACAAGGAACTTGGCTGAATCCA 

GGCTTCCCTCCTCAAAGTTGTGGATATGCAACTGTGACGGATGCCG 
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Table 12 (continued). Nucleotide sequence of plasmid pLP/VSVG. 

AAGCAGTGATTGTCCAGGTGACTCCTCACCATGTGCTGGTTGATGA 

ATACACAGGAGAATGGGTTGATTCACAGTTCATCAACGGAAAATG 

CAGCAATTACATATGCCCCACTGTCCATAACTCTACAACCTGGCAT 

TCTGACTATAAGGTCAAAGGGCTATGTGATTCTAACCTCATTTCCAT 

GGACATCACCTTCTTCTCAGAGGACGGAGAGCTATCATCCGTGGGA 

AAGGAGGGCACAGGGTTCAGAAGTAACTACTTTGCTTATGAAACTG 

GAGGCAAGGCCTGCAAAATGCAATACTGCAAGCATTGGGGAGTCA 

GACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGATCTCTT 

TGCTGCAGCCAGATTCCCTGAATGCCCAGAAGGGTCAAGTATCTCT 

GCTCCATCTCAGACCTCAGTGGATGTAAGTCTAATTCAGGACGTTG 

AGAGGATCTTGGATTATTCCCTCTGCCAAGAAACCTGGAGCAAAAT 

CAGAGCGGGTCTTCCAATCTCTCCAGTGGATCTCAGCTATCTTGCTC 

CTAAAAACCCAGGAACCGGTCCTGCTTTCACCATAATCAATGGTAC 

CCTAAAATACTTTGAGACCAGATACATCAGAGTCGATATTGCTGCT 

CCAATCCTCTCAAGAATGGTCGGAATGATCAGTGGAACTACCACAG 

AAAGGGAA.CTGTGGGATGACTGGGCACCATATGAAGACGTGGAAA 

TTGGACCCAATGGAGTTCTGAGGACCAGTTCAGGATATAAGTTTCC 

TTTATACATGATTGGACATGGTATGTTGGACTCCGATCTTCATCTTA 

GCTCAAAGGCTCAGGTGTTCGAACATCCTCACATTCAAGACGCTGC 

TTCGCAACTTCCTGATGATGAGAGTTTATTTTTTGGTGATACTGGGC 

TATCCAAAAATCCAATCGAGCTTGTAGAAGGTTGGTTCAGTAGTTG 

GAAAAGCTCTATTGCCTCTTTTTTCTTTATCATAGGGTTAATCATTG 

GACTATTCTTGGTTCTCCGAGTTGGTATCCATCTTTGCATTAAATTA 

AAGCACACCAAGAAAAGACAGATTTATACAGACATAGAGATGAAC 

CGACTTGGAAAGTAACTCAAATCCTGCACAACAGATTCTTCATGTT 

TGGACCAAATCAACTTGTGATACCATGCTCAAAGAGGCCTCAATTA 

TATTTGAGTTTTTAATTTTTATGAAAAAAAAAAAAAAAAACGGAAT 

TCACCCCACCAGTGCAGGCTGCCTATCAGAAAGTGGTGGCTGGTGT 

GGCTAATGCCCTGGCCCACAAGTATCACTAAGCTCGCTTTCTTGCT 

GTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCCAACTACTA 

AACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGCCTA 

ATAAAAAACATTTATTTTCATTGCAATGATGTATTTAAATTATTTCT 

GAATATTTTACTAAAAAGGGAATGTGGGAGGTCAGTGCATTTAAAA 

CATAAAGAAATGAAGAGCTAGTTCAAACCTTGGGAAAATACACTA 

TATCTTAAACTCCATGAAAGAAGGTGAGGCTGCAAACAGCTAATGC 

ACATTGGCAACAGCCCCTGATGCCTATGCCTTATTCATCCCTCAGA 

AAAGGATTCAAGTAGAGGCTTGATTTGGAGGTTAAAGTTTTGCTAT 

GCTGTATTTTACATTACTTATTGTTTTAGCTGTCCTCATGAATGTCTT 

TTCACTACCCATTTGCTTATCCTGCATCTCTCAGCCTTGACTCCACT 

CAGTTCTCTTGCTTAGAGATACCACCTTTCCCCTGAAGTGTTCCTTC 

CATGTTTTACGGCGAGATGGTTTCTCCTCGCCT 
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Table 12 (continued). Nucleotide sequence of plasmid pLP/VSVG. 

GGCCACTCAGCCTTAGTTGTCTCTGTTGTCTTATAGAGGTCTACTTG 

AAGAAGGAAAAACAGGGGGCATGGTTTGACTGTCCTGTGAGCCCT 

TCTTCCCTGCCTCCCCCACTCACAGTGACCCGGAATCCCTCGACATG 

GCAGTCTAGCACTAGTGCGGCCGCAGATCTGCTTCCTCGCTCACTG 

ACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCA 

CTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGC 

AGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACC 

GTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCC 

TGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAA 

CCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCC 

CTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTC 

CGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCT 

GTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTG 

TGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGT 

AACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCAC 

TGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAG 

GCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACAC 

TAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACC 

TTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCG 

CTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAG 

AAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCT 

GACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGA 

GATTATCAAAAAGGATOTCACCTAGATCCTTTTAAATTAAAAATG 

AAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGAC 

AGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTC 

TATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACT 

ACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATAC 

CGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCA 

GCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATC 

CGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT 

AGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAG 

GCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCC 

GGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCA 

AAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAA 

GTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAAT 

TCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGA 

GTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGT 

TGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCA 

GAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAA 

ACTCTCAAGGATCTTACCGCTGTTGA 
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Table 12 (continued). Nucleotide sequence of plasmid pLP/VSVG. 

GATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGC 

ATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGG 

CAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGA 

ATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGG 

TTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAAT 

AAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTG 

ACGGGATCCCCTGAGGGGGCCCCCATGGGCTAGAGGATCCGGCCT 

CGGCCTCTGCATAAATAAAAAAAATTAGTCAGCCATGAGC SBQID 

N0:8 
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WHAT IS CLAIMED IS: 

1. A method of producing an RNA molecule for use as an 

interfering RNA comprising: 

(a) identifying one or more target nucleic acid sequences; 

(b) preparing one or more nucleic acid molecules which 
encode one or more interfering RNAs, wherein said interfering RNAs bind to 
said one or more target nucleic acid sequences; 

(c) combining 

(i) one or more first nucleic acid molecules 
encoding one or more interfering RNAs flanked by one or more first type lis 
restriction enzyme recognition sites; 

(ii) one or more second nucleic acid molecules 
comprising one or more selectable markers flanked by one or more second 
type Us restriction enzyme recognition sites; and 

(iii) one or more site-specific type Us restriction 

enzymes; and 

(d) incubating said combination under conditions sufficient 
to join one or more of said nucleic acid molecules encoding one or more 
interfering RNAs and one or more of said second nucleic acid molecules, 
thereby producing one or more desired product nucleic acid molecules; 

(e) inserting said one or more product nucleic acid 
molecules into a host cell; and 

(f) expressing said one or more interfering RNAs in said 

host cell. 

2. The method of claim 1 , wherein said first and second restriction 
sites are the same. 

3. The method of claim 1 , wherein said first and second restriction 
sites are different. 
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4. The method of claim 1, wherein said first or second nucleic 
acid molecule is a vector. 

5. The method of claim 1, wherein said first or second nucleic 
acid molecule is a linear nucleic acid molecule. 

6. The method of claim 1, wherein said one or more selectable 
markers comprises at least one DNA segment encoding an element selected 
from the group consisting of an antibiotic resistance gene, a gene that encodes 
a fluorescent protein, an auxotrophic marker, a toxic gene and a phenotypic 
marker. 

7. The method of claim 6, wherein said antibiotic resistance gene 
is selected from the group consisting of a chloramphenicol resistance gene, an 
ampicillin resistance gene, a tetracycline resistance gene, a Zeocin resistance 
gene, a spectinomycin resistance gene and a kanamycin resistance gene. 

8. The method of claim 6, wherein said toxic gene is selected 
from the group consisting of a ccdB gene, a gene encoding a tus protein, a 
kicB gene, a^cB gene, an ASK1 gene, a OX174 E gene and a Dpnl gene. 

9. The method of claim 1 , wherein said first nucleic acid molecule 
and/or said second nucleic acid molecule further comprises one or more 
recombination sites. 

10. The method of claim 9, wherein said first nucleic acid molecule 
and/or said second nucleic acid molecule further comprises one or more 
topoisomerase recognition sites and/or one or more topoisomerases. 
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11. The method of claim 10, wherein said first nucleic acid 
molecule and/or said second nucleic acid molecule comprises two or more 
recombination sites. 

12. The method of claim 11, wherein said topoisomerase 
recognition site, if present, is flanked by said two or more recombination sites. 

13. The method of claim 12, wherein said recombination sites are 
selected from the group consisting of attB sites, attP sites, atiL sites, attR 
sites, lox sites, psi sites, tnpl sites, dif sites, cer sites, frt sites, and mutants, 
variants and derivatives thereof. 

14. The mpthod of claim 10, wherein said topoisomerase 
recognition site, if present, is recognized and bound by a type I topoisomerase. 

15. The method of claim 14, wherein said type I topoisomerase is a 
type IB topoisomerase. 

16. The method of claim 1 5, wherein said type IB topoisomerase is 
selected from the group consisting of eukaryotic nuclear type I topoisomerase 
and a poxvirus topoisomerase. 

17. The method of claim 1, wherein said expressed interfering 
RNA is between 35-60 nucleotides in length. 

18. The method of claim 17, wherein said expressed interfering 
RNA forms a hairpin loop. 

19. The method of claim 18, wherein said hairpin loop is between 
4-8 nucleotides in length. 
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20. The method of claim 19, wherein said hairpin loop comprises 
regions of complementarity that are between 1 8-25 nucleotides in length. 

21. A vector comprising: 

(a) one or more toxic genes; 

(b) one or more type lis restriction enzyme recognition 

sites; and 

(c) one or more site-specific recombination sites. 



22. The vector of claim 21, wherein said type lis restriction 
enzyme recognition sites are selected from the group consisting of Bsal, Bbsl, 
BbvJl BsmAI, BspMi, Eco3U, BsmBI, Bael, Fokl, Hgal, SfdNl and S*132L 

23. The vector of claim 21, wherein said recombination sites are 
selected from the group consisting of att& sites, attP sites, atfL sites, attR 
sites, lox sites, psi sites, tnpl sites, dif sites, cer sites, fit sites, and mutants, 
variants and derivatives thereof 

24. The vector of claim 21, wherein said vector further comprises 
one or more topoisomerase recognition sites and/or one or more 
topoisomerases. 

25. The vector of claim 24, wherein said molecule comprises two 
or more recombination sites. 

26. The vector of claim 24, wherein said topoisomerase recognition 
site, if present, is flanked by said two or more recombination sites. 

27. The vector of claim 24, wherein said topoisomerase recognition 
site, if present, is recognized and bound by a type I topoisomerase. 
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28. The vector of claim 27, wherein said type I topoisomerase is a 
type IB topoisomerase. 

29. A method of regulating the expression of one or more genes in 
a transgenic cell or a transgenic animal using interfering RNA, comprising: 

(a) identifying one or more target nucleic acid sequences in 
said cell or animal; 

(b) preparing one or more nucleic acid molecules which 
encode one or more interfering RNAs, wherein said interfering RNAs bind to 
said one or more target nucleic acid sequences; 

(c) combining 

(i) one or more first nucleic acid molecules 
encoding one or more interfering RNAs flanked by one or more first type lis 
restriction enzyme recognition sites; 

(ii) one or more second nucleic acid molecules 
comprising one or more selectable markers flanked by one or more second 
type lis restriction enzyme recognition sites; and 

(iii) one or more site-specific type lis restriction 

enzymes; and 

(d) incubating said combination under conditions sufficient 
to join one or more of said one or more nucleic acid molecules encoding one 
or more interfering RNAs and one or more of said second nucleic acid 
molecules, thereby producing one or more desired product nucleic acid 
molecules; 

(e) inserting said one or more interfering RNA-containing 
product nucleic acid molecules into said cell or one or more cells of said ' 
animal, under conditions such that said one or more interfering RNAs bind to 
said one or more target nucleic acid sequences, thereby regulating expression 
of said one or more genes. 
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30. The method of claim 29, wherein said expressed interfering 
RNA is between 35-60 nucleotides in length. 

31. The method of claim 30, wherein said expressed interfering 
RNA forms a hairpin loop. 

32. The method of claim 31, wherein said hairpin loop is between 
4-8 nucleotides in length. 

33. The method of claim 32, wherein said hairpin loop comprises 
regions of complementarity that are between 18-25 nucleotides in length. 

34. The method of claim 29, wherein said regulation results in 
decreased expression of said one or more genes. 
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, . . ACACOG GAGACC- ccdB- GGTCTCA TTTTTTTTCTASA . . . 
. .TGTGGCCTCTGO-ccdB-CCAGAGTAAAAAAAAGATCG. . . 



Bsol DIGESTION 



...A 

...TGTGG-P 



P-TTTTTTTTCTAGA. . . 
AAAAGATOG... 



SEQ ID H0:31 
SEQ ID N0:32 



SEQ ID N0:33 



CACCGNNNNNNNNNNNNNNNNNNN IN9FRT 20 10 

CNNNNNNNNNNNNNNNNNNNAAAA ,r,0Lm SEQ ID N0:35 
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RSV PROMOTER (1-229) 51 jr (1-410) 

5'SPUCE DONOR (520-520) 
PSL (521-565) 
RRE (1076-1317) 
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1 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 
GAAAGGACGC AATAGGGGAC TAAGACACCT ATTGGCATAA TGGCGGAAAC TCACTCGACT 

61 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 
ATGGCGAGCG GCGTCGGCTT GCTGGCTGGC GTCGCTCAGT CACTCGCTCC TTCGCCTTCT 

121 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
CGC6GGTTAT GCGTTTGGCG GAGAGGGGCG CGCAACCGGC TAAGTAATTA CGTCGACCGT 

181 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATA CGCGTACCGC 
GCTGTCCAAA GGGCTGACCT TTCGCCCGTC ACTCGCGTTG CGTTAATTAT GCGCATGGCG 

241 TAGCCAGGAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG TCAGGATGGC CTTCTGCTTA 
ATCGGTCCTT CTCAAACATC TTTfiCGT TTT TCCGGTAGGC AGTCCTACCG GAAGAC GAAT 

RRN T2 terminator 

301 GTTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC ACCCTCCGGG CCGTTGCTTC 
CAAACTACGG ACCGTCAAAT ACCGCCCGCA GGACGGGCGG TGGGAGGCCC GGCAACGAAG 

361 ACAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA GGAGAGCGTT CACCGCCAAA 
TGTTGCAAGT TTAGGCGAGG GCCGCCTAAA CAGGATGAGT CCTCTCGCAA GTGGCTGTTT 

421 CAACAGATAA AACGAAAGGC CCAGTCTTCC GACTGAGCCT TTCGTTTTAT TTGATGCCTG 
GTTGTC TATT TTGCTTTCCG GGTCAGAAGG CTGACTCGGA AAGCAAAATA M CTACGGAC 

RRN Tl terminator 

M13 Rr(-2D) 

481 GCAGTTCCCT ACTCTCGCGT TAACGCTAGC ATGGATGTTT TCCCAGTCAC GACGTTGTAA 
CGTCAAGGGA TGAGAGCGCA ATTGCGATCG TACCTACAAA AGGGTCAGTG CTGCAACATT 
M13 For (-20) 

541 AACGACGGCC AGTCTTAAGC TCGGGCCCCA AATAATGATT TTATTTTGAC TGATAGTGAC 
TTGCTGCCGG TCAGAATTCG AGCCCGGGGT TTATTACTAA AATAAAACTG ACTATCACTG 

601 CTGTTCGTTG CAACAAATTG ATGAGCAATG CTTTTTTATA ATGCCAACTT TGTAC AAAAA 
GACAAGCAAC GTTGTTTAAC TACTCGTTAC GAAAAAATAT TACGGTTGAA A&TGTTTTT 

SENSE PRM 

pENTRla-462F hsU6-1wd 

U6 PROMOTER. 

661 AGCAGGCTTT AAAGGAACCA ATTCAGTCGA CTGGATCCGG TACCATfGGTC GGGCAGGAAG 
TCGTCCGAAA TTTCCTTGGT TAAGTCAGCT GACCTA6GCC ATGGTTCCAG CCCGTCCTTC 
SENS E PRIV 
hsU6-lw 
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U6 PROMOTER 

721 A6G6CCTATT TCCCATGATT CCTTCATATT T6CATATACG ATACAA6GCT GTTAGAGAGA 
TCCCGGATAA AGGGTACTAA 6GAAGTATAA ACGTATAT6C TATGTTCC6A CAATCTCTCT 

U6 PROMOTER 

781 TAATTAGAAT TAATTTGACT GTAAACACAA AGATATTAGT ACAAAATACG TGACGTAGAA 
ATTAATCTTA ATTAAACTGA CATTTGTGTT TCTATAATCA TGTTTTATGC ACTGCATCTT 

_IL6__P_RQM0J_ER_„ 

841 AGTAAT AATT TCTTGGGTAG TTTGCAGTTT TAMATTATG TTTTAAAAT G GACTAT CATA 
TCATTATTAA AGAACCCATC AAACGTCAAA ATTTTAATAC AAAATTTTAC CTGATAGTAT 
U6-PSE PROMOTER E 

USlSSOBSSL 

901 TGCTTACCGT AACTTGAAAG TATTTCGATT TCTTGGCTTT ATATATCTTG TGGAAAGGAC 
ACGAATGGCA TTGAACTTTC ATAAAGCTAA AGAACCGAAA TATATAGAAC ACCTTTCCT6 

+1 base transcription starts 
U6„ PROMOTER Notl 



961 


GAAACACCGG AGACCGCGGC 


CGCTGGATCC 


GGCTTACTAA 


AAGCCAGATA 


ACAGTATGCG 




CTTTGT6GCC TCTGGCGCCG 


GCGACCTAGG 


CCGAATGATT 


TTCGGTCTAT 


TGTCATACGC 






Bsal . 










1021 


TATTTGCGCG 


CTGAIIIIIG 


CGGTATAAGA 


ATATATACTG 


ATATGTATAC 


CCGAAGTATG 




ATAAACGCGC 


GACTAAAAAC 


GCCATATTCT 


TATATATGAC 


TATACATATG 


GGCTTCATAC 


1081 


TCAAAAAGAG 


GTGTGCTATG 


AAGCAGCGTA 


TTACAGTGAC 


AGTTGACAGC 


GACAGCTATC 




AGTTTTTCTC 


CACACGATAC 


TTCGTCGCAT 


AATGTCACTG 


TCAACTGTCG 


CTGTCGATAG 


1141 


AGTTGCTCAA 


GGCATATATG 


ATGTCAATAT 


CTCCGGTCTG 


GTAAGCACAA 


CCATGCAGAA 




TCAACGAGTT 


CCGTATATAC 


TACAGTTATA 


GAGGCCAGAC 


CATTCGIGI f 


GGTACGTCTT 


1201 


TGAAGCCCGT 


CGTCTGCGTG 


CCGAACGCTG 


GAAAGCGGAA 


AATCAGGAAG 


GGATGGCTGA 




ACTTCGGGCA 


GCAGACGCAC 


GGCTTGCGAC 


CTTTCGCCTT 


TTAGTCCTTC 


CCTACCGACT 














ccdB 


1261 


GGTCGCCCGG 


TTTATTGAAA 


TGAACGGCTC 


TTTTGCTGAC 


GAGAACAGGG 


ACTGGTGAAA 




CCAGCGGGCC 


AAATAACTTT 


ACTTGCCGAG 


AAAACGACTG 


CTCTTGTCCC 


TGACCACTTT 



ccdB 



1321 TGCAGTTTAA GGTTTACACC TATAAAAGAG AGAGCCGTTA TCGTCTGTTT GT6GATGTAC 
ACGTCAAATT CCAAATGTGG ATATTTTCTC TCTCGGCAAT AGCAGACAAA CACCTACATG 
ccdB 

1381 AGAGTGATAT TATTGACACG CCCGGGCGAC GGATGGTGAT CCCCCTGGCC AGTGCACGTC 
TCTCACTATA ATAACTGTGC GGGCCCGCT6 CCTACCACTA GGGGGACCGG TCACGTGCAG 
m ccdB 

1441 TGCTGTCAGA TAAAGTCTCC CGTGAACTTT ACCCGGTGGT GCATATCGGG GATGAAAGCT 
ACGACAGTCT ATTTCAGAGG GCACTTGAAA TGGGCCACCA CGTATAGCCC CTACTTTCGA 

FIG.12B 
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ccdB 

...Bsal., 

1501 66CGCATGAT GACCACCGAT AT6GCCAGTG TGCC6GTCTC CGTTATCGGG GMGAAGTGG 
CCGCGTACTA CTGGTGGCTA TACCGGTCAC ACGGCCAGAG GCAATAGCCC CTTCTTCACC 
ccdB 

1561 CTGATCTCAG CCACCGCGAA AATGACATCA AAAACGCCAT TAACCTGATG TTCTGGGGAA 
GACTAGAGTC GGTGGCGCTT 7TACTGTAGT TTTTGCGGTA ATTGGACTAC AAGACCCCTT 
ccdB Pol III terminato r 



1621 TATAAGGTCT CAIIIIIlll CTAGACCCAG CTTTCTTGTA CAAAGTT6GC ATTATAAGAA 
ATATTCCAGA GTAAAAAAAA GATCTGGGTC GAAAGAACAT GTTTCAACCG TAATATTCTT 

1681 AGCATTGCTT ATCAATTTGT TGCAACGAAC AGGTCACTAT CAGTCAAAAT AAAATCATTA 
TCGTAACGAA TAGTTAAACA ACGTTGCTTG TCCAGTGATA GTCAGTTTTA TTTTAGTAAT 

M13 Rev 

1741 TTTGCCATCC AGCTGATATC CCCTATAGTG AGTCGTATTA CATGGTCATA GCTGTTTCCT 
AAACGGTAGG TCGACTATAG G6GATATCAC TCAGCATAAT GTACCAGTAT CGACAAAGGA 
M13 Rev 

1801 GGCAGCTCTG GCCCGTGTCT CAAAATCTCT GATGTTACAT TGCACAAGAT AAAAATATAT 
CCGTCGAGAC CGGGCACAGA GTTTTAGAGA CTACAATGTA ACGTGTTCTA 77TTTATATA 

kanR 

1861 CATCATGAAC AATAAAACTG TCTGCTTACA TAAACAGTAA TACAA6GGGT GTTATGAGCC 
GTAGTACTTG TTATTTTGAC AGACGAATGT ATTTGTCATT ATGTTCCCCA CMTACTCGG 

kanR 

1921 ATATTCAACG GGAAACGTCG AGGCCGCGAT TAAATTCCAA CATGGATGCT GATTTATATG 
TATAAGTTGC CCTTTGCAGC TCCGGCGCTA ATTTAAGGTT GTACCTACGA CTAAATATAC 
kanR . . .. 

1981 GGTATAAATG GGCTCGCGAT AATGTCGGGC AATCAGGTGC GACAATCTAT CGCTTGTATG 
CCATATTTAC CCGAGCGCTA TTACAGCCCG TTAGTCCACG CTGTTAGATA GCGAACATAC 
. kanR 

2041 GGAAGCCCGA TGCGCCAGAG TTGTTTCTGA AACATGGCAA AGGTA6CGTT GCCAATGATG 
CCTTCGGGCT ACGCGGTCTC AACAAAGACT TTGTACCGTT TCCATCGCAA CGGTTACTAC 
kanR 

2101 TTACAGATGA GATGGTCAGA CTAAACTGGC TGACGGAATT TATGCCTCTT CCGACCATCA 
AATGTCTACT CTACCAGTCT GATTTGACCG ACTGCCTTAA ATACGGAGAA GGCTGGTAGT 
kanR 

2161 AGCATTTTAT CCGTACTCCT GATGATGCAT GGTTACTCAC CACTGCGATC CGCGGAAAAA 
TCGTAAAATA GGCATGAGGA CTACTACGTA CCAATGAGTG GTGACGCTAG CGGCCTTT7T 

FIG.12C 
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kanR 

2221 CA6CATTCCA GGTATTAGAA GAATATCCTG ATTCAGGT6A AAATATTGTT GATGCGCTGG 
GTCGTAAGGT CCATAATCTT CTTATAGGAC TAAGTCCACT TTTATAACAA CTACGCGACC 
_ kanR. , 

2281 CAGTGTTCCT GCGCC6GTTG CATTCGATTC CTGTTTGTAA TTGTCCTTTT AACAGCGATC 
GTCACAAGGA CGCGGCCAAC GTAAGCTAAG GACAAACATT AACAGGAAAA TTGTCGCTAG 
kanR 

2341 GCGTATTTCG TCTCGCTCAG GCGCAATCAC GMTGAATAA CGGTTTGGTr GATGCGAGTG 
CGCATAAAGC AGAGCGAGTC CGCGTTAGTG CTTACTTATT GCCAAACCAA CTACGCTCAC 
JcanR 

2401 ATTTTGATGA CGAGCGTAAT GGCTGGCCTG TTGAACAAGT CTGGAAAGM ATGCATAAAC 
TAAAACTACT GCTCGCATTA CCGACCGGAC AACTTGTTCA GACCTTTCTT TACGTATTTG 
k§nR 

2461 TTTTGCCATT CTCACCGGAT TCAGTCGTCA CTCATGGTGA TTTCTCACTT GATAACCTTA 
AAAACGGTAA GAGTGGCCTA AGTCAGCAGT GAGTACCACT AAAGAGTGAA CTATTGGAAT 
kanR 

2521 TTTTTGACGA GGGGAAATTA ATAGGTTGTA TTGATGTTGG ACGAGTCGGA ATCGCAGACC 
AAAAACTGCT CCCCTTTAAT TATCCAACAT AACTACAACC TGCTCAGCCT TAGCGTCTGG 
kanR 

2581 GATACCAGGA TCTTGCCATC CTATGGAACT GCCTCGGTGA GTTTTCTCCT TCATTACAGA 
CTATGGTCCT AGAACGGTAG GATACCTTGA C6GAGCCACT CAAAAGAGGA AGTAATGTCT 
kanR 

2641 AACGGCTTTT TCAAAAATAT GGTATTGATA ATCCTGATAT GAATAAATTG CAGTTTCATT 
TTCCCGAAAA AGTTTTTATA CCATAACTAT TAGGACTATA CTTATTTAAC GTCAAAGTAA 
kanR 

2701 TGATGCTCGA TGAGTTTTTC TAATCAGAAT TGGTTAATTG G7TGTAACAC TGGCAGAGCA 
ACTACGAGCT ACTCAAAAAG ATTAGTCTTA ACCAATTAAC CAACATTGTG ACCGTCTCGT 

2761 TTACGCTGAC UGACGGGAC GGCGCAAGCT CATGACCAAA ATCCCTTAAC GTGAGTTACG 
AATGCGACTG AACTGCCCTG CCGCGTTCGA GTACTGGTTT TAGGGAATTG CACTCAATGC 

dUC ori 

. '2821 CGTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
6CAGCAAGGT GACTCGCAGT CTGGGGCATC TTTTCTAGTT TCCTAGAAGA ACTCTAGGAA 
dUC ori 

2881 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT 
AAAAAGACGC GCATTAGACG ACGAACGTTT GTTTTTTTGG TGGCGATGGT CGCCACCAAA 
. dUC ori 

2941 GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC 
CAAACGGCCT AGTTCTCGAT GGTTGAGAAA AAGGCTTCCA TTGACCGAAG TCGTCTCGCG 



SUBSTITUTE Sri 




RULE 26) 



20/49 



puc ori 

3001 AGATACCAAA TACTGTCCTT CTAGTGTA6C CGTA6TTAG6 CCACCACTTC AA6AACTCTG 
TCTATGGTTT ATGACAGGAA GATCACATCG GCATCAATCC GGTGGTGAAG TTCTTGAGAC 
dUC ori 

3061 TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 
ATCGTGGCGG ATGTATGGAG CGAGACGATT AG6ACAATGG TCACCGACGA CGGTCACCGC 
dUC ori 

3121 ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GC6CAGCGGT 
TATTCAGCAC AGAATGGCCC AACCTGAG7T CTGCTATCAA TGGCCTATTC CGCGTCGCCA 
. dUC ori 

3181 CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC 
GCCCGACTTG CCCCCCAAGC ACGTGTGTCG GGTCGAACCT CGCTTGCTGG ATGTGGCTTG 
dUC ori 

3241 TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT TGCCGAAGGG AGAAAGGCGG 
ACTCTATGGA TGTCGCACTC GTAACTCTTT CGCGGTGCGA AGGGCTTCCC TCTTTCCGCC 
dUC ori 

3301 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG 
TGTCCATAGG CCATTCGCCG TCCCAGCCTT GTCCTCTCGC GTGCTCCCTC GAAGGTCCCC 
pUC ori 

3361 GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT 
CTTTGCGGAC CATAGAAATA TCAGGACAGC CCAAAGCGGT GGAGACTGAA CTCGCAGCTA 
dUC ori 

3421 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT 
AAAACACTAC GAGCAGTCCC CCCGCCTCGG ATACCTTTTT GCGGTCGTTG CGCCGGAAAA 
dUC ori 

3481 TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
ATGCCAA6GA CCGGAAAACG ACCGGAAAAC GAGTGTACM 

FIG.12E 



SUBSTITUTE SHEET (RULE 26) 



21/49 




SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



23/49 




SUBSTITUTE SHEET (RULE 26) 



24/49 



5 '-UUCAGUGAGUAGAGUCAUATT- 3 1 SEQ ID N0:36 




S'-TTAAGUCAC 



CA 




CTCAGUAU-5 



SEQ ID NO: 37 



| "CORE" -) 

1 19 
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GENE OF INTEREST (GOI) 
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SEQUENCE LISTING 



<110> Invitrogen Corporation 
Chesnut, Jon 
Madden, Knut 
Dudas, Miroslav 
Leong, Louis 
Harris, Adam 

<12 0> METHODS AND COMPOSITIONS FOR PERFORMING SEAMLESS CLONING 
<130> 0942.581PC02 

<150> 60/493,322 
<151> 2003-08-08 

<160> 51 

<170> Patentln version 3.0 

<210> 1 

<211> 3519 

<212> DNA 

<213> Artificial 

<220> 

<223> pENTRU6 Vector 
<400> 1 

ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 
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cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240 

tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300 

gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360 

acaacgttca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420 

caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480 

gcagttccct actctcgcgt taacgctagc atggatgt'tt tcccagtcac gacgttgtaa 540 

aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600 

ctgttcgttg caacaaattg atgagcaatg cttttttata atgccaactt tgtacaaaaa* 660 

agcaggcttt aaaggaacca attcagtcga ctggatccgg taccaaggtc gggcaggaag 720 

agggcctatt tcccatgatt ccttcatatt tgcatatacg atacaaggct gttagagaga 780 

taattagaat taatttgact gtaaacacaa agatattagt acaaaatacg tgacgtagaa 840 

agtaataatt tcttgggtag tttgcagttt taaaattatg ttttaaaatg gactatcata 900 

tgcttaccgt aacttgaaag tatttcgatt tcttggcttt atatatcttg tggaaaggac 960 

gaaacaccgg agaccgcggc cgctggatcc ggcttactaa aagccagata acagtatgcg 102 0 

tatttgcgcg ctgatttttg cggtataaga atatatactg atatgtatac ccgaagtatg 1080 

tcaaaaagag gtgtgctatg aagcagcgta ttacagtgac agttgacagc gacagctatc 1140 

agttgctcaa ggcatatatg atgtcaatat ctccggtctg gtaagcacaa ccatgcagaa 1200 

tgaagcccgt cgtctgcgtg ccgaacgctg gaaagcggaa aatcaggaag ggatggctga 1260 

ggtcgcccgg tttattgaaa tgaacggctc ttttgctgac gagaacaggg actggtgaaa 1320 

tgcagtttaa ggtttacacc tataaaagag agagccgtta tcgtctgttt gtggatgtac 1380 

agagtgatat tattgacacg cccgggcgac ggatggtgat ccccctggcc agtgcacgtc 1440 

tgctgtcaga taaagtctcc cgtgaacttt acccggtggt gcatatcggg gatgaaagct 1500 

ggcgcatgat gaccaccgat atggccagtg tgccggtctc cgttatcggg gaagaagtgg 1560 

ctgatctcag ccaccgcgaa aatgacatca aaaacgccat taacctgatg ttctggggaa 1620 

tataaggtct catttttttt ctagacccag ctttcttgta caaagttggc attataagaa 1680 

agcattgctt atcaatttgt tgcaacgaac aggtcactat cagtcaaaat aaaatcatta 1740 

tttgccatcc agctgatatc ccctatagtg agtcgtatta catggtcata gctgtttcct 1800 

ggcagctctg gcccgtgtct caaaatctct gatgttacat tgcacaagat aaaaatatat 1860 

catcatgaac aataaaactg tctgcttaca taaacagtaa tacaaggggt gttatgagcc 1920 

atattcaacg ggaaacgtcg aggccgcgat taaattccaa catggatgct gatttatatg 1980 

ggtataaatg ggctcgcgat aatgtcgggc aatcaggtgc gacaatctat cgcttgtatg 2040 
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ggaagcccga 


tgcgccagag 


4- 4- «-<4- 4- 4- /"« hna 

tLgtuCCtya 


aapafarrraa a rrrrt" a ptpttI - t~ ciP , p , aa4TCfa.t"pr 
daCdUyy tad rt.yyua.yL.yuL. yuv-dduyduy 


2100 


ttacagatga 


gatggtcaga 


^m. 1* a «*t 4-* /"*V /T t~* 

ctaaac uggc 


frraprirraaH- +* ia 1" c t~ t" proarr'atra 
uy duyy aai u uciuyi~L.uL>Lu uu>yauuctL.u.ci 


2160 


agcattttat 


ccgtactcct 


gatgatgcat 


r~rrr 4-4-ra/*«4-(^ , n«*^ r"*n ^™»t"fTrTTsa r^* ^PTrra aasa 
y y u UdCUUdL. UdL.uyL.ydUU uuuyy aaaaa 


222 0 


cagcattcca 


ggtattagaa 


gaauaucc ug 


duuudyyuyd dddUduuyuu yauyuyuuyy 


2280 


cagtgtxcct 


gcgccggttg 


»S ^ 4h MHM 4t" 4" /** 


Cty uuuy uaa uuyuuuuuuu v^v- 


2340 


gcgtatttcg 


ucucgcucag 


gcgcaatcdc 


ydduyddUdd uyyuuuyyuu yauyuyay uy 


2400 


atttgatgac 


gagcgtaatg 


gcuggccugu 


UyadUddyUU uyydddyddd uy^dUdctauL. 


2460 


tttgccattc 


tcaccggatt 


cagtcgtcac 


ucauggugau uucucduuuy dudduuuudu 


* J* v 


ttttgacgag 


gggaaat taa 


taggttgtat 


ugauguugga cgagucggaa ucgcagaccy 




ataccaggat 


cttgccafccc 


tatggaactg 


CCuCggugag UUUUCUCCUU CattaCdyad 


Zytv 


acggcuu.uL.Ti 


caaaaa ua ug 


r**4* ^ 4~ 4— a 4~ a a 
gudLugaLdd 


UUUUydUduy dctudctctu uy u oy ululol lu 


2700 


gauge ucgau 


yaytutLtCt 


aa+*r< srra si +- 4- 
aatCagaaUt 


yyuudduuyy uuyudduduu yyuayayuoi. 


2760 


uacgcugacu 


4™ ft a /~*ftfr s /^r 

ugacgggacy 


gcgcaagc uc 


duydUUdddd uuuuuudawy i.y»yi'U»uy^ 


2820 


gfccgttccac 


tgagcgtcag 


accccgtaga 


aaagaucaaa ggauuuucuu ydyduuuuuu 


2880 


ttttctgcgc 


guaaucugcc 


gcttgcaaac 


aaaaaaacca ccgcuaccag cyguyguuuy 




tttgccggat 


caagagctac 


caactcctt t 


uccgaaggua acuggcuuca gcagagcgca 


JUUV 


gataccaaat 


actgtccctc 


uag uguagcc 


^4- i^t4- 4~ a r~ir~i pancapffra a rra a p* 4"<™f f~ 

guaguudyyc Cdccdcuuud ayddcuuuyu 


JuDu 


agcaccgccu 




CtCL.yCC.ddU 


uuuyuuctuud y LyyvLyLuy \»v»yi.yy^y» 


3120 


taagtcgtgt 


c u uaccyggu 


tyyaCLCadg 


a nrta f* a prt" 4~ a ppp;rrat"aann p»pTP , aprr , CfPi*r P* 
duydUdyuud uuyyduddyy uyuayuyy 


3180 


gggctgaacg 


gggggttcgt 


gcacacagcc 


cagcuuyydy cgaduydcuu duduuydduu 




gagacaccua 


cagcgugagc 


auugagaaag 


r+ rt r« r< a rtr* t* 4* p pr«p;a a ppp;a pra a a PTPTP*PrPf a 

cgccacgcuu cccgaagyyd gaaaggcggd 


J o U v 


cagguauccg 


y Ldagcyycd 


ft ft ft 4» r**r~* a ^ ft 

ggg u cggaac 


apparrappnn a pp/a rrnrra pjp 4~ t" P'P'aPtPlPTPrpr 

ayyagagcgc acgagggagc ctccaggggg 




aaacgcctgg 


tatctttata 


gtcctgtcgg 


gtttcgccac ctctgacttg agcgtcgatt 


3420 


tttgtgatgc 


tcgtcagggg 


ggcggagcct 


atggaaaaac gccagcaacg cggccttttt 


3480 


acggttcctg 


gccttttgct 


ggccttttgc 


tcacatgtt 


3519 



<210> 2 

<211> 8688 

<212> DNA 

<213> Artificial 
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<220> 

<223> pLenti6/V5-DEST 
<400> 2 



aatgtagtcc 


uatgcaauac 




+- f- /-t /-i a ar^a^^f 
tcyCaaCatg 


rr f* a a pcra i* era 


attaaraaca 


60 


tgccttacaa 


ggagagaaaa 


ayCaCCyuyv. 


dL.y ^ a. l. uy 


otrreiaacf t aa 

y ^yynav^ u>c%o. 


aataatacaa 


120 


X* «-f 1* /i 4— 4— n 

ccgtgcccua 




a a r"arra r*nn(~i 




a i" era a c* era a 


ccactcaatt 


180 


gccgcau ugc 


agagauauuy 


t*at*t-f aa ofrn 
taULLaay i-y 


uciy l. LLyo 




aatctctctcf 


240 


gu cagaccag 


ac c ugagc c u 


yyyttyCtt.tL 


t*nrf(^f*aap^a 
LygL- LctdL. Lo 


a etna a cccac 




300 


tcaataaagc 


ttgccfctgag 


tgccccaagc 


agcg cgLgccy 


L.y ULuy uiyi- 


rff*rta pt f t OCT 
y ^y Ct^» • »— • Lyy 


360 


fcaactagaga 


tccc ucagac 


^^♦~^+"^arTt"r'' 
CCLLULay t.c 


af*i^*rTi~<?ffTaaa 
ay uy uy y actci 




ataaccrccca 


420 


aacagggact 


ugaaagcgaa 


ag g g a aa^ u a. 


y c*y y cty l. l. l» u 


p4- caaccicaa 


aactcaactt 


480 


gc tgaagcgc 


gcacggcaag 


aggcgagggg 


uyy LyaL uy y 


uy ay i»a\<.y 


aaaaatttta 


540 


actagcggag 


gc c agaagga 


gagagacggg 


uycyay ay L.y 


f* r* a cr t* a "h t* a a 

ULiOyifCLL L. CtCl 


y ^yyyyy^y a 


600 


at tagatcgc 


gatgggaaaa 


aauccgguca 


a /tot/*' a / it'%nt n s 

aggccagggg 


rra a a rra a a a a 
yadnyciaaaa 


ciu.ctocictca.i_ o c* 


660 


aaacatatag 


tatgggcaag 


cagggagc^a 


yddcydLLcy 


r>arrt" t" aa+*rr 
U ay LLaaL-LU 


yy y ^.a 


720 


gaaacatcag 


aaggc uguag 


dCaddUdCty 


yydLdyLudL 






780 


tcagaagaac 


4r 4™ /^r a +■* /""i a 4-* 

LLdyaL CaUC 




rrt" a rrr»a acrp 
y l ay LaaLtu 


4- r't a thai* at 

ttuai. uy uy u 


acat caaaaa 


840 


auagagauaa 


a a *-*e a ~> /-» — i 3 

aayaCaCCaa 


rrrra a rrf»+" +■ 1r a 
yy dayCtLLd 


y Qifaay ct i_<xy 




aaacaaaaat 


900 


aagacuaucg 


—\ ri a a t-tf~* 


y g c y c u y ct l. 




yyayyayyay 


atataaacraa 


960 


caauuggaga 


a nf- /-va ^ f* 1* 1 

ayuyaaLUciu 


a4"aaaf~a^aa 


a rrt~ a rr*r a a a A 
ay Lay laaaa 


ahhaaappat" 
aLLyaatvcib 


t acraaataac 


1020 


acccaccaag 


gcaaagagaa 


gagcggugca 


na na na a a a a 
y cty cty dadaa 


ay ay way uyy 


yoai< ay y ay v_. 


1080 


f- 4- 4- ^rf- f- 4- f- 

LCUyLtCCCC 


gggctcccgg 


gagcagcagg 


ddyLdL tduy 


yyuy Lay L»y u 


waa Lyovyu u. 


1140 


gacggtacag 


gccagacaat 


x# a 4** 4* ^ 4* ^t^^ 

uatcgcccgg 


LdLciytyCay 


r»arrparfa a/^a 
CdyCaydaCd 


af 4*4« rrPhrta rr 
aCLLyLtydy 


i 9 on 


ggctattgag 


gcgcaacagc 


atccgucgca 


acucacaguc 




ayLdyL L.L.ua 


X_Ov 


ggcaagaatc 


ctggctgtgg 


aaagatacct 


aaaggatcaa 


cagctcctgg 


ggatttgggg 


1320 


ttgctctgga 


aaactcattt 


gcaccactgc 


tgtgccttgg 


aatgctagtt 


ggagtaataa 


1380 


atctctggaa 


cagatttgga 


atcacacgac 


ctggatggag 


tgggacagag 


aaattaacaa 


1440 


ttacacaagc 


ttaatacact 


ccttaattga 


agaatcgcaa 


aaccagcaag 


aaaagaatga 


1500 


acaagaatta 


ttggaattag 


ataaatgggc 


aagtttgtgg 


aattggttta 


acataacaaa 


1560 


ttggctgtgg 


tatataaaat 


tattcataat 


gatagtagga 


ggcttggtag 


gtttaagaat 


1620 



agtttttgct 


gtactttcta 


tagtgaatag 
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agttaggcag 


ggatattcac 


cattategtt 


1680 


tcagacccac 


ctcccaaccc 


cgaggggacc 


cgacaggccc 


gaaggaatag 


aagaagaagg 


1740 


tggagagaga 


gacagagaca 


gatccattcg 


attagtgaac 


ggatctcgac 


ggtatcgata 


1800 


agcttgggag 


ttccgcgtta 


cataacttac 


ggtaaatggc 


ccgcctggct 


gaccgcccaa 


1860 


cgacccccgc 


ecattgaegt 


caataatgac 


gtatgttccc 


atagtaaege 


caatagggac 


1920 


tttccattga 


cgtcaatggg 


tggagtattt 


aeggtaaact 


gcccacttgg 


cagtacatca 


1980 


agtgtatcat 


atgccaagta 


cgccccctat 


tgacgtcaat 


gaeggtaaat 


ggcccgcctg 


2040 


gcattatgcc 


cagtacatga 


ccttatggga 


ctttcctact 


tggcagtaca 


tetaegtatt 


2100 


agtcatcgct 


attaccatgg 


tgatgcggtt 


ttggcagtac 


atcaatgggc 


gtggatagcg 


2160 


gtttgactca 


eggggattte 


caagtctcca 


ccccattgac 


gtcaatggga 


gtttgttttg 


2220 


gcaccaaaat 


caaegggact 


ttccaaaatg 


tegtaacaac 


tccgccccat 


tgacgcaaat 


2280 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


tataagcaga 


getegtttag 


tgaacegtea 


2340 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat 


agaagacacc 


gactctagag 


2400 


gatccactag 


tccagtgtgg 


tggaattctg 


cagatatcaa 


caagtttgta 


caaaaaagct 


2460 


gaacgagaaa 


cgtaaaatga 


tataaatatc 


aatatattaa 


attagatttt 


gcataaaaaa 


2520 


cagactacat 


aatactgtaa 


aacacaacat 


atccagtcac 


tatggeggee 


gcattaggca 


2580 


ccccaggctt 


tacactttat 


gcttccggct 


cgtataatgt 


gtggattttg 


agttaggatc 


2640 


cggcgagatt 


ttcaggagct 


aaggaagcta 


aaatggagaa 


aaaaatcact 


ggatatacca 


2700 


ccgttgatat 


atcccaatgg 


categtaaag 


aacattttga 


ggcatttcag 


teagttgetc 


2760 


aatgtaccta 


taaccagacc 


gttcagctgg 


atattaegge 


ctttttaaag 


acegtaaaga 


2820 


aaaataagca 


caagttttat 


ccggccttta 


ttcacattct 


tgcccgcctg 


atgaatgetc 


2880 


atccggaatt 


ccgtatggca 


atgaaagacg 


gtgagctggt 


gatatgggat 


agtgttcacc 


2940 


cttgttacac 


cgttttccat 


gagcaaactg 


aaacgttttc 


ategctctgg 


agtgaatacc 


3000 


acgacgattt 


ccggcagttt 


ctacacatat 


attegcaaga 


tgtggcgtgt 


tacggtgaaa 


3060 


acctggccta 


tttccctaaa 


gggtttattg 


agaatatgtt 


tttegtctea 


gccaatccct 


3120 


gggtgagttt 


caccagtttt 


gatttaaacg 


tggecaatat 


ggacaacttc 


ttcgcccccg 


3180 


ttttcaccat 


gggcaaatat 


tataegcaag 


gcgacaaggt 


getgatgecg 


ctggcgattc 


3240 


aggttcatca 


tgccgtctgt 


gatggcttcc 


atgtcggcag 


aatgcttaat 


gaattacaac 


3300 


agtactgcga 


tgagtggcag 


ggcggggcgt 


aaagatctgg 


ateeggctta 


etaaaageca 


3360 


era haar acrl- a 


t ac*a t* a t" 1 1 a 


r* cr f cs p t" er a t~ t~ 




a a era a t* a t~ a fc 


a a fc era t a t crt- 


3420 


atacccgaag 


tatgtcaaaa 


agaggtgtgc 


tatgaagcag 


cgtattacag 


tgacagttga 


3480 
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cagcgacagc 


uaucagutgc 


4- a a rrri naha 

Lcaaggcaua 


i~ a or a 1~ cr h r* a 

LCI L.y ct Ly LLO 


a^atctGPQcr 


LL Lyy uauyv^ 


3540 


acaaccatgc 


agaatgaagc 


ccgtcgtcfcg 


r~*n 4~ rT o p rra a ^ 
CyLgCLyddL 


y CLyy etacty l 


crn a aaahfacr 
yyaaaaLLay 


3600 


gaagggatgg 


ctgaggtege 


ccggcc uacc 


/♦a a a 4~ **ra a /~*rr 

gaaa ugaacg 


yCULLLL uyL 


Ly dLy ety aaL 


3660 


agggactggt 


gaaatgcagt 


+- 4- a a t~t r~t 4- 4~ +r a 

LLoayyttta 


r*ar"»r"<+~a+"aaa 
CdCCLdLddd 


agay ay ay LL 


crt" hat* ccrl" c t" 
y uuctLLy ull 


3720 


gtccgcggac 


#"» f* a *"» a rra 

ytaCayaguy 


af/at"h af frra 
dLdULdLtya 


c* a r*cfr* f* cetera 
LctLy Lv-Ly yy 


Ly ot-yy ciuyy 


•r a a f* c rr cr t 

LMQLLLvLL L 


3780 


ggcca.gu.gca 


cgee uy c tyt 


fi a r^a t* a a a /^rt - 
UdydLdddy t. 


L LLLLy Lyctct 


LLLuaLLLyy 


fr*r*fcrt*erc , a t" a r 
Lyy Ly La La l 


3840 


cggggatgaa 


a #t /■» f* ft r* a 

age egg eye a 


ugauyaccac 


frrafat" cine r* 

cyauauyycL 


a n t~ cr *r cr r 1 c act 
cty Ly Ly LLy y 


LL L L Ly L LaL 


J ^7 \J \J 


cggggaagaa 


ytggc Lyatc 


ucayccauLy 


pnaaaa 4- na r" 
Lyctct etc* Ly ctL 


aLLaaaaaLy 


rr , ai*haarrh 
l l a l l aaL l l 


3960 


gauge uctgg 


ggaa uauaaa 


cgucaggc lc 


rrrt*l"at*apan 
Cyt La Let Let L 


a err* f* a crt* r» V cr 
cty L Lay ll Ly 


c a nnt* nnarr 
l ay y l l y »l l 


4020 


atag cgacLg 


yataty uty t 


yLLLtacayu. 


a ♦* t*a +■ a rrt* 
ct l La Ly Lay l 


LLyLLLLLLa 


t~ nca a a a t~ c t* 
Ly Laaaa ll l 


4080 




a*rt-Cfat-a-h+--t- 


atoLCdtLLl 


dLyLLLL LL>y 


L LLay L- L L LL 


^" t" at* anaaan 
l Ly LaLaaay 


4140 


+"ncrt~*rcra'pai" 


ui>ay LdLdy l 


y y cyy uty v- u 


n ci a cr c* i" a cr 3 
Ly ay l l Lay a 


yyyLLLy Lyy 


t~ t" CCra a era t a 
L u Ly aay y l a 


4200 


dytccauccc 




LLLyyuLLLy 


Ct L LL LCLLy Ly 


LaLLy y LLay 


Laa Ly ay l l l 


4260 


^■r^"ra ^ 4™ 4™ 

ggaaccaauc 


/r 4~ ci 4~" *~tr*r a a 4" j**t 

ccgrggaaug 


4~ <^r4" ^»4" ^ a 4" 4™ 

v-gugucay t.t 


agggngcgga 


day LLLLLay 


y l LLLLLay y 


4T90 


caggcagaag 


caugcaaagc 


aegea tc cca 


a 4» 4" a ^r4" a <™*r^ 

at l ague age 


ddccdyytyt 


ygaaay llll 


*± J O V 


caggc ccccc 


agcaggcaga 


ag Ldtycddd 


y La Ly Let LL L 


Lad l Lay l La 


rre'a a rrahan 
yLaaLLatay 


4440 


Lcccgccec l 


a a /"* r*r+rir* r* c 

aactccyccu 


aLLLLyLLLL 


LdOL L L Ly L L 


ca erf* t" fence 
Lay l> ll Ly ll 


LaL LL LLLyL 


4500 


cccatggc ug 


actaatuccL 


4- f- i- 4- 4- 4- x 4- rr 

LLLdL LLdty 


/ 

Lcty ctyy LLy ct 


yyLLyLL ll l 


crcc ,< r r"" l*na 
y ll ll Ly ay l 




taCUCCayaa 


y Lay tyoyya 


yyuLLLLLLy 


g«*yycL uayg 


pi"f"t'f'nr , aaa 
l LLLLyLdaa 


aanpfppp crcr 
ddy l LLLLyy 


4620 


rfan^f 1 +- /~f 4~ * 4- 


dtccattccc 


fin a +" nfrrfl 4- /— < 

ggatctyotc 


a nr*a rT4~ rri" 4- 

ayudcy LytL 


era paahhaal" 
ydLddLLddL 


pat* prrpfpah a 

LdLLyyLdLd 


4fifi0 

lOOV 


nt* a t* a f~ ncscic 
y Ld uaL.L>y^L 


af a rr+- a 4- a a f- 

a Lay Ldtaau 


a /""rra r^a a rrrrt - 
aLyaLdayy l. 


rra crcra arhaa 
y ay y aaL Laa 


a fpahcrapra 

GLLOtyyLLfl 


ay LLLLLyLL 


4740 


l l d ay aay ct a. 


LOLaUUL. L La 


Lty adayay L- 


a a rrrr 1" a e a 
ddLyy l La La 


aLLdaLay La 


t*c , c , cc , at"pt"f* 

LLLLLaLL LL 


4 8 00 

"i a \j v 


hnaanar'ha c 

tgaayao uaL 


dytytt-yLCa 


y c g c ay lull 


L ll Lay Ly ctL 


y y LLy l a ill 


t* c* a c 1 +■ crcr f" cr f* 
LLdLLyyLyL 


*± a o v 


f» a a t" nt* ahah 

caatguatdL 


pa t" 4* f* t* aot*n 
tdt.Lt.CdCtg 


gggg acc ttg 


Ly Lay ctaL ll 


gtggtgcLgg 


npapi*pipt*rfp 
y LdL Ly l uy l 




ugcugeggea 


gc uggeaace 


4- a /™» 4" 4— rr 4" a t" 

LgacLLgudu 


eg Lcgcgatc 


rrr*r a a a 4— /-« a /*r a 

ggaaa ugagd 


ana ftfirtryr* a 4~ 

acaygggcaL 


q q n 


^ ^ 4— <"«r a *"r f» ^ 

CttyayCCCC 


tgcggacggu; 


gccgacaggc 


4- 4~ <^ 4" rirta 4- 

gc l lc ccyac 


4™ ti r+ a 4- ri 4~ 
CLyCdLCCLg 


rrrfa t~ a a arm 

ggate adage 




c a l ag tgaag 


gacagtga u,g 


gacagccgac 


ggcagccggg 


a 4" 4— /*»^»4" /-» a a +* 

aUt.CgL.yaaL 


4- r* <*^4~ /t/™* /-i 4~ 

tge egcee lc 


D±U\J 


tggttatgtg 


tgggagggct 


aagcacaatt 


egagcteggt 


acctttaaga 


ccaatgactt 


5160 


acaaggcagc 


tgtagatctt 


agecactttt 


taaaagaaaa 


ggggggactg 


gaagggctaa 


5220 


ttcactccca 


acgaagacaa 


gatctgettt 


ttgcttgtac 


tgggtctctc 


tggttagacc 


5280 



7/49 

agatctgagc ctgggagctc tctggctaac tagggaaccc actgcttaag cctcaataaa 5340 

gcttgccttg agtgcttcaa gtagtgtgtg cccgtctgtt gtgtgactct ggtaactaga 5400 

gatccctcag acccttttag tcagtgtgga aaatctctag cagtagtagt tcatgtcatc 5460 

ttattattca gtatttataa cttgcaaaga aatgaatatc agagagtgag aggaacttgt 5520 

ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag 5580 

catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg 5640 

tctggctcta gctatcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc 5700 

cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc 5760 

ctcggcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc tagggacgta 5820 

cccaattcgc cctatagtga gtcgtattac gcgcgctcac tggccgtcgt tttacaacgt 5880 

cgtgactggg aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc 5940 

gccagctggc gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc 6000 

ctgaatggcg aatgggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 6060 

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc 6120 

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 6180 

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat 6240 

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc 6300 

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc 6360 

tattcttttg atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg 6420 

atttaacaaa aatttaacgc gaattttaac aaaatattaa cgcttacaat ttaggtggca 6480 

cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata 6540 

tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga 6600 

gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc 6660 

ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg 6720 

cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc 6780 

ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat 6840 

cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact 6900 

tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat 6960 

tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga 7020 

tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc 7080 

ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga 7140 
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tgcctgtagc 


<5 ?a 4- jt/t p^ a a p* a 

sa tyy LdaLd 


a /~" prfc fc ptpti pa 
dLy LLyLyLd 


dCIL Ld L LCLCIL 


fcaacaaacta 


cttactctag 


7200 


cttcccggca 


«a yi a af*haa^a 
aCodLLadLa 


era t* ptpt a fc no 
y dL uyy d Ly y 


a np/pnrra f*aa 
dyy Lyy d Ldd 


a cr fc fc ci c aerer a 

cty l uyuayya 


ccs.cht.ct.cxc 

W fc* w w W W^j w 


7260 


gcLcggccct 


tccyy l Lyy l 


fc acr t h fc a fcfc a 

Lyy l l l d l Ly 


r* t - era i* a a a t* c 

L LyctLdClCXLL 


fc cr a aa c c ercr fc 


aaacataoat 

3 R 3 W 3 ^333 


7320 


c fccgcggtat 


^ l-frfra etc* a 


c fc rrrrrrrfr fan 
l Ly y y y LLdy 


ciuyy Lddy ll 


L L L L L y L Cl L L 


afcaatfcatct 

w w VA W Nh-» J** 


7380 


acacgacggg 


gagtcaggca 


oCtdtyydty 


a a pna a a f*a t~f 
ddLy dddLdy 


a r»artafcr*nrpfc 
dLdy dLLy l l 


eracra fc acrerfcn 
y dy a l dy y Ly 


7440 


n/if i»«a 4— 

ccccacugau 


4- a arfr»a fc t" ptpt 

taa.gcciL.ugg 


t* a a pfcerfc paer 
LddL Ly LLdy 


CtL L Cl dy L L Ld 


pt*pafcafcafca 

L L L d LCt L C*. L O. 


ctttaaatta 

w w w w t*y » ^ y 


7500 


a ttfcaaaac t 


tCdUCLLtda 


fc fcfc a a a a nna 
LLLddddyyd 


ll Ldy y uy dd 


CTafcppfcfcfcfcfc 

ydLLLLLLLL 


oataatctca 


7560 


Ly dLLdddd L 


LLLt UClClLy L 


craerfcfcfc fc pafc 
y c*y L L L L L y L 


c c a e; t er a cr c 


atcactacccc 

W Vw' C* ~n w \* 


gtagaaaaga 


7620 


fc r* a a a pt/ra t" O 
LLddctyyctuL 


fcfc pfcfcaacra fc 
lull Ly dy d L 


cpfcttttfctc 

LLLLLLLLLL 


taccrcataat 


ctgctgcttg 


caaacaaaaa 


7680 


aacnar 1 t~*c\r* \~ 
ddL.t.aLLyv' l 


a rT*a rTPPrrrfcer 
dLLdy Lyy Ly 


CTfcfctafcfcfcap 

y l l l y l l l 


r*cf era t - e; a a a a 


gctaccaacfc 


ctttttccga 


7740 


agg uaactyg 


LL UUdy Lay a 


nnrrrana hap 
y Ly Ldy d lcil 


n a a a t* ar t* at" 

L CICtCl LdL Ly L 


tcttctaatcr 


taaccataat 


7800 




L L LLddy aat 


ll uy Ldy L CtL 


nac pi - a p a t a 

L y L L LUwuLQ 


ccfcccrctctQ 


ctaatcctgt 


7860 


r* r* a pt+* /-rrt P 1 


LyLLyLLay l 


rf rmera faa cf fc 
y y Ly d Lddy l 


pert - crt*ptfcap 

Ly Ly Lv L LCI L 


c cfcrcf fc fc era a c 

333 33 


tcaagacgat 


7920 


a t" a rrrrrra 

ayt uaccyyo 


fc a a rrcrrrtr' a pt 
Lddy y Ly Ldy 


pfffif- r , PTPTrrr«fc 
Lyy l Lyy y l l 


era a pcrercr craef 
y ddLyyyyyy 


tticcffccrcaca 


cagcccagct 


7980 


i-yy dy ty ctac 


yctLLLca.LC3.LL 


era a p fc na pra fc 
y ddL Ly dy a l 


acctacaaca 


tgagctatga 


gaaagegeca 


8040 


Cyv ULt-utyd 


arrrrrra rra a an 

dyyy dy dddy 


rrrrrcra pa ercrfc 
y LyydLdyy l 


a fc p pcrcr t a acr 

d LLLy y L«ay 


c ocr c acrcr cr fc c 

33 3 j3 


aaaacaaoaa 

y y aa^ay y «y 


8100 


a rir^ pyp< a orra pt 

agcgcacgag 


yyagcLLLCci 


nmrrrTrra a a ppi 

ggggg aaac s 


ppfccrerfc afcpfc 


fcfcafcaafc ret 

L L Cl L »y L L L L 


atcaaatttc 


8160 


cine* a p~< p» o 4~ it 


ctL l Ly ay Ly l 


perat- fcfc fcfc afc 

LydLLLLLyL 


er a fc crpfc pcrfc p 
yctLyLLLyLL 


a ercr cfcrcra p era 


aacctataaa 

C4.WJ W» W w4 fc jM q 


8220 


a ^ a a r*rtr+ r> a r 

ddddCyCCay 


c cXdcy Ly y ll 


t-hf ap rrrr fc 
L L L L LdLy y L 


fc e , r*fconr , r , t" fc 

LLL LyyLLL L 


fc fc er p fc ererp p fc 
L Ly L Ly y ll l 


fc fc tachcaca 

L ~»y L W> V* w U. 


82 80 


fcerfc fcpfcfc trr 


LyLyLLClLLL 


r 1 p t" era fc fc p fc a 

LLLyCtLLLLy 


t"crerafcaacccf 


tafc fcaccacc 

w WL W W w* w w >-j W V/ 


fcfctaaataaa 

« ^ w y t*y ^"y**y 


8340 


p fc era fc a p p a p 
l uy acciv/Ly v< 


fc car* ncrc aer p 
l Ly l l y Lay l 


y cmv^j aw ^"y 


aacacaacaa 


atcaataaac 


aaaaaaacaa 

3«33""3 v "33 


8400 


ctcty dy Ly l 


aaf a rrfpaa a 
a a l a l y l ad cl 


LLyLL LL LLL 


ppcfpcrpcrfcfccr 

LLyLyLyLLy 


appaafcfccafc 

y L Ly Cl L LLCaL 


taatacaact 


8460 


nrrr* a nna parr 
yy uaLy dLdy 


nfc fc fc pppna r* 

y L L L LL Ly dL 


fc rrrra a a erpcrcr 
i-yydddyvyy 


crp a cifc aa crpcr 
y Lay Ly cty Ly 


l cicx l y l aa l l 


aatataaafcfc 

aw ^*y »y 3 


8520 


agctcactca 


ttaggcaccc 


caggctttac 


actttatget 


tccggctcgt 


atgttgtgtg 


8580 


gaattgtgag 


eggataacaa 


tttcacacag 


gaaacagcta 


tgaccatgat 


fcacgccaagc 


8640 


gcgcaattaa 


ccctcactaa 


agggaacaaa 


agctggagct 


gcaagctt 




8688 



<210> 3 
<211> 6964 
<212> DNA 
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<213> Artificial 



<220> 

<223> pLenti6/V5-dT0PO™ 
<400> 3 



aatgtagtct 


tatgcaatac 


tcttgtagtc 


ttgcaacatg 


guaacgauga 


frt* t" a err* a a f 3 

y idyCftato 




tgccttacaa 


ggagagaaaa 


agcaccgtgc 


atgccgattg 


guggaaguaa 


y y Lyy ccicy a. 


X A \> 


ucgtgccuua 


ttaggaaggc 


aacagacggg 


ucugacaugg 


auugg acgaa 


npapf" rra a 4- +~ 
L. v-.cH- cy da u L. 


180 

X O v 


gccgcattgc 


agagat at tg 


uauuuaagug 


ccuagcucga 


taCaldddLy 


yy LtLOLuLy 


94 0 


gttagaccag 


atctgagcct 


gggagc tctc 


uggcuaacua 


gggaacccac 


4- /-f »— ■ 4- 4- a a f~f t~* f 

cycLtaayct 


00 


tcaataaagc 


ttgccttgag 


LgcttcaayL 


agugugugcc 


^ 4™ /^r 4— 

CyLCtytcyt 


guyacuctyg 


T fiO 


taactagaga 


tccctcagac 


ccuuuuaguc 


aguguggaaa 


a ^ +* a a 

otctcuayca 


guyycgcccg 


*± iS u 


aacagggact 


tgaaagcgaa 


agggaaacca 


gaggagcucu 


CuCyaCycay 


f~*z3 4" /-1 rt/~i ri 4- +• 

y Luyyu l. l. 


4fl0 


gcugaagcgc 


gcacggcaag 


aggcgagggg 


cggcgaccgg 


uy aycacgcc 


aaaaat"hM*n 
adaad ILL L.y 




actagcggag 


gctagaagga 


gagagatggg 


ugcgagagcg 


f-/ia rrt* a f" f a a 
LCagi-oLLaa 


frrTfnnrrff ana 

gegggggaga 


600 
out/ 


attagatcgc 


gaugggaaaa 


— ^ — i 4* ^ ri r** 4"" "4~ t» 

aauucgguua 


aggc c agggg 


Cf2 aa/^aaaa3 
yaaayaaadd 


a+* afaaahfa 
dUdladdULd 


660 


aaacauauag 


uaugggcaag 


cagggagcua 


gaacgau ucg 


i^a^T+"^aa+*/^/^ 

cayucddicc 


4-/-rr*ro/~'4~/^4~ 4- a 

uyyccuyL.ua. 


79 0 


gaaacatcag 


aaggctguag 


acaaauac ug 


ggacagcuac 


aaccatcctL 


ccayacayyd 


7 fin 


ucagaagaac 


uuagaucauu 


atacaa caca 


guaycadccc 


ucuaL-tguyu 


frr»a 4- /~» a a a rr/~r 


R40 


atagagataa 


aagacaccaa 


/«w>v«h q /™<c /■« 4* 4** 4* ^ 

ggaagccu.ua 


S /™i ^ a ^* ^ #™i 

CfdCaayatay 


a r*rrr a a /"r a <**r /*i a 

aggaay aycd 


aaaoaaaarrt - 
aadCadaay u 


Q00 


aagaccaccg 


cacagcaagc 


ggccgccgau 


CUUCagdCCl 


99 a 99 a 99 a 9 


at-af rra <f rrrT3 

auatyoyyya 




caattggaga 


agugaattat 


ataaatataa 


agtagtaaaa 


auugaaccau 


uaggaguagc 




acccaccaag 


gcaaagagaa 


gagtggtgca 


gagagaaaaa 


agagcagtgg 


gaataggagc 


1 n 0 n 


tttgttcctt 


gggttcttgg 


gagcagcagg 


aagcactatg 


ggcgcagcgt 


caatgacget 


114 U 


gacggtacag 


gccagacaat 


tattgtctgg 


tatagtgcag 


cagcagaaca 


atttgetgag 


1200 


ggctattgag 


gcgcaacagc 


atctgttgca 


actcacagtc 


fc 9999C a tca 


agcagctcca 


1260 


ggcaagaatc 


ctggctgtgg 


aaagatacct 


aaaggatcaa 


cagctcctgg 


ggatttgggg 


1320 


ttgctctgga 


aaactcattt 


gcaccactgc 


tgtgccttgg 


aatgctagtt 


ggagtaataa 


1380 


atctctggaa 


cagatttgga 


atcacacgac 


ctggatggag 


tgggacagag 


aaattaacaa 


1440 


ttacacaagc 


ttaatacact 


ccttaattga 


agaatcgcaa 


aaccagcaag 


aaaagaatga 


1500 


acaagaatta 


ttggaattag 


ataaatgggc 


aagtttgtgg 


aattggttta 


acataacaaa 


1560 
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ttggctgtgg 


tatataaaat 


tattcataat 


gatagtagga 


ggcttggtag 


gtttaagaat 


1620 


agtttttgct 


gtactttcta 


tagtgaatag 


agttaggcag 


ggatattcac 


cattatcgtt 


1680 


tcagacccac 


ctcccaaccc 


cgaggggacc 


cgacaggccc 


gaaggaatag 


aagaagaagg 


1740 


tggagagaga 


gacagagaca 


gatccattcg 


attagtgaac 


ggatctcgac 


ggtatcgata 


1800 


agcttgggag 


ttccgcgtta 


cataacttac 


ggtaaatggc 


ccgcctggct 


gaccgcccaa 


1860 


cgacccccgc 


ccattgacgt 


caataatgac 


gtatgttccc 


atagtaacgc 


caatagggac 


1920 


tttccattga 


cgtcaatggg 


tggagtattt 


acggtaaact 


gcccacttgg 


cagtacatca 


1980 


agtgtatcat 


atgccaagta 


cgccccctat 


tgacgtcaat 


gacggtaaat 


ggcccgcctg 


2040 


gcattatgcc 


cagtacatga 


ccttatggga 


ctttcctact 


tggcagtaca 


tctacgtatt 


2100 


agtcatcgct 


attaccatgg 


tgatgcggtt 


ttggcagtac 


atcaatgggc 


gtggatagcg 


2160 


gtttgactca 


cggggatttc 


caagtctcca 


ccccattgac 


gtcaatggga 


gtttgttttg 


2220 


gcaccaaaat 


caacgggact 


ttccaaaatg 


tcgtaacaac 


tccgccccat 


tgacgcaaat 


2280 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


tataagcaga 


gctcgtttag 


tgaaccgtca 


2340 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat 


agaagacacc 


gactctagag 


2400 


gatccactag 


tccagtgtgg 


tggaattgat 


cccttcacca 


agggctcgag 


tctagagggc 


2460 


ccgcggttcg 


aaggtaagcc 


tatccctaac 


cctctcctcg 


gtctcgattc 


tacgcgtacc 


2520 


ggttagtaat 


gagtttggaa 


ttaattctgt 


ggaatgtgtg 


tcagttaggg 


tgtggaaagt 


2580 


ccccaggctc 


cccaggcagg 


cagaagtatg 


caaagcatgc 


atctcaatta 


gtcagcaacc 


2640 


aggtgtggaa 


agtccccagg 


ctccccagca 


ggcagaagta 


tgcaaagcat 


gcatctcaat 


2700 


tagtcagcaa 


ccatagtccc 


gcccctaact 


ccgcccatcc 


cgcccctaac 


tccgcccagt 


2760 


tccgcccatt 


ctccgcccca 


tggctgacta 


atttttttta 


tttatgcaga 


ggccgaggcc 


2820 


gcctctgcct 


ctgagctatt 


ccagaagtag 


tgaggaggct 


tttttggagg 


cctaggcttt 


2880 


tgcaaaaagc 


tcccgggagc 


ttgtatatcc 


attttcggat 


ctgatcagca 


cgtgttgaca 


2940 


attaatcatc 


ggcatagtat 


atcggcatag 


tataatacga 


caaggtgagg 


aactaaacca 


3000 


tggccaagcc 


tttgtctcaa 


gaagaatcca 


ccctcattga 


aagagcaacg 


gctacaatca 


3060 


acagcatccc 


catctctgaa 


gactacagcg 


tcgccagcgc 


agctctctct 


agcgacggcc 


3120 


gcatcttcac 


tggtgtcaat 


gtatatcatt 


ttactggggg 


accttgtgca 


gaactcgtgg 


3180 


tgctgggcac 


tgctgctgct 


gcggcagctg 


gcaacctgac 


ttgtatcgtc 


gcgatcggaa 


3240 


atgagaacag 


gggcatcttg 


agcccctgcg 


gacggtgccg 


acaggtgctt 


ctcgatctgc 


3300 


atcctgggat 


caaagccata 


gtgaaggaca 


gtgatggaca 


gccgacggca 


gttgggattc 


3360 


gtgaattgct 


gccctctggt 


tatgtgtggg 


agggctaagc 


acaattcgag 


ctcggtacct 


3420 
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ttaagaccaa tgacttacaa ggcagctgta gatcttagcc actttttaaa agaaaagggg 3480 

ggactggaag ggctaattca ctcccaacga agacaagatc tgctttttgc ttgtactggg 3540 

tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 3600 

cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 3660 

gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt 3720 

agtagttcat gtcatcttat tattcagtat ttataacttg caaagaaatg aatatcagag 3780 

agtgagagga acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca 3840 

aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc 3900 

aatgtatctt atcatgtctg gctctagcta tcccgcccct aactccgccc atcccgcccc 3960 

taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg 4020 

cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg 4080 

gaggcctagg gacgtaccca attcgcccta tagtgagtcg tattacgcgc gctcactggc 4140 

cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 4200 

agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc 4260 

ccaacagttg cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc 4320 

ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 43 80 

tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 4440 

aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 4500 

acttgattag ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 4560 

tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 4620 

caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg 4680 

gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgct 4740 

tacaatttag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 4800 

taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 4860 

tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 4920 

gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 4980 

gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 5 040 

cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 5100 

tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac 5160 

tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 5220 
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atgacagtaa 


gagaattatg 


cagtgctgcc 


ataaccatga 


gtgataacac tgcggccaac 


5280 


ttacttctga 


caacgatcgg 


aggaccgaag 


gagctaaccg 


cttttttgca caacatgggg 


5340 


gatcatgtaa 


ctcgccttga 


tcgttgggaa 


ccggagctga 


atgaagccat accaaacgac 


5400 


gagcgtgaca 


ccacgatgcc 


tgtagcaatg 


gcaacaacgt 


tgcgcaaact attaactggc 


5460 


gaactactta 


ctctagcttc 


ccggcaacaa 


ttaatagact 


ggatggaggc ggataaagtt 


5520 


gcaggaccac 


ttctgcgctc 


ggcccttccg 


gctggctggt 


ttattgctga taaatctgga 


5580 


gccggtgagc 


gtgggtctcg 


cggtatcatt 


gcagcactgg 


ggccagatgg taagccctcc 


5640 


cgtatcgtag 


ttatctacac 


gacggggagt 


caggcaacta 


tggatgaacg aaatagacag 


5700 


atcgctgaga 


taggtgcctc 


actgattaag 


cattggtaac 


tgtcagacca agtttactca 


5760 


tatatacttt 


agattgattt 


aaaacttcat 


ttttaattta 


aaaggatcta ggtgaagatc 


5820 


ctttttgata 


atctcatgac 


caaaatccct 


taacgtgagt 


tttcgttcca ctgagcgtca 


5880 


gaccccgtag 


aaaagatcaa 


aggatcttct 


tgagatcctt 


tttttctgcg cgtaatctgc 


5940 


tgcttgcaaa 


caaaaaaacc 


accgctacca 


gcggtggttt 


gtttgccgga tcaagagcta 


6000 


ccaactcttt 


ttccgaaggt 


aactggcttc 


agcagagcgc 


agataccaaa tactgttctt 


6060 


ctagtgtagc 


cgtagttagg 


ccaccacttc 


aagaactctg 


tagcaccgcc tacatacctc 


6120 


gctctgctaa 


tcctgttacc 


agtggctgct 


gccagtggcg 


ataagtcgtg tcttaccggg 


6180 


ttggactcaa 


gacgatagtt 


accggataag 


gcgcagcggt 


cgggctgaac ggggggttcg 


6240 


tgcacacagc 


ccagcttgga 


gcgaacgacc 


tacaccgaac 


tgagatacct acagcgtgag 


6300 


ctatgagaaa 


gcgccacgct 


tcccgaaggg 


agaaaggcgg 


acaggtatcc ggtaagcggc 


6360 


agggtcggaa 


caggagagcg 


cacgagggag 


cttccagggg 


gaaacgcctg gtatctttat 


6420 


agtcctgtcg 


ggtttcgcca 


cctctgactt 


gagcgtcgat 


ttttgtgatg ctcgtcaggg 


6480 


gggcggagcc 


tatggaaaaa 


cgccagcaac 


gcggcctttt 


tacggttcct ggccttttgc 


6540 


tggccttttg 


ctcacatgtt 


ctttcctgcg 


ttatcccctg 


attctgtgga taaccgtatt 


6600 


accgcctttg 


agtgagctga 


taccgctcgc 


cgcagccgaa 


cgaccgagcg cagcgagtca 


6660 


gtgagcgagg 


aagcggaaga 


gcgcccaata 


cgcaaaccgc 


ctctccccgc gcgttggccg 


6720 


attcattaat 


gcagctggca 


cgacaggttt 


cccgactgga 


aagcgggcag tgagcgcaac 


6780 


gcaattaatg 


tgagttagct 


cactcattag 


gcaccccagg 


ctttacactt tatgcttccg 


6840 


gctcgtatgt 


tcrtcrtcrciaafc 


tcrtcracrcciaa 


taacaatttc 


acacaocraaa raarhahoar 


G900 

O 7 U *J 


catgattacg 


ccaagcgcgc 


aattaaccct 


cactaaaggg 


aacaaaagct ggagctgcaa 


6960 



gctt 6964 



13/49 

<210> 4 
<211> 8634 
<212> DNA 
<213> Artificial 



<220> 

<223> pLenti4/V5-DEST 
<400> 4 



aafccrfcacrtpfc 

no ^y t»Cty U\>L> 


fcafcerpaafcac 


tcttataatc 


i^yuctctuctuy 


erfcaapcrafcera crfcfcacrpaapa 
y u ctctuy ct y ct y l» uay ctctu ct 


60 


fcac c fc fc a p a a 
i_ uctu ctct 


erera era era a a a 
yy cty cty ctct act 


acrpappcrfcerp 

cty t»y 


a fcerppera fc fcer 


erl - rrpa a er fc a a errrfc ererfc a pera 
y uyy ctcty uctct yy uy y uctuyct 




fccerfcanpfc fc a 


fc fc a or era a crcrp 
u udyyctayy u 


a a p a era p erercr 
cictuciyctuyyy 


fc p fc era p a fc errr 


afcfcperapfraa <~"Papfceraafcfc 
ct u uy y ctuy ctct uuctuuy ctctu u 


low 


cr cccrcafc tor 


aeracrafcafcfcef 
cty cty cl u ct u uy 


fcafc fc.t~aaeifc.Cf 
lciu w uctcty uy 


ppfcacrpfcpcra 
uu ucty u v-«y ct 


fcapafcaaapcr cicjfcpfcpfcpfca 
uctu ct u ctct ctuy yyuuuui~uuy 


240 


attaaaccaa 


at cfccracrcpfc 


crcrcfaerpfc pfcp 
y y y o.y wll>uw 


fc acre fc a ac fc. a 
uyy l> uctm* uct 


333 y ^cicty uu 


3 00 


fccaataaaap 


fcfccrnpfc taacr 


fc crpfc tc a aot 


aerfcerfcafcerec 

cty t»y uy L.y ^.u 


pcrfc pfccrfc fcerfc. afccrapfccfccrcr 
uyuuuyuuyu yuyctuuut^yy 


360 


fcaac fcagaga 


fccccfccacrap 


crfcfc ttacrtc 


aafcafcercra a a 
c*y >»*y »»yycictct 


afccfccfcacrca cr fcercrpoppper 


420 


aa c agg ga c t 


taaaaacoaa 

dwl Cl^n wM CLM 


aacroaaa pea 


era crcracict nfc 


cfcpcraccrpaci cracfcpcrcrpfcfc 


480 


crpfccraacrpc/p 


ctp a pcrcrpa a ei 
y uctu yyuct cty 


a acrPciacrcTcrcT 
o-yyw-yciyyyy 


pcfcrpcrapfccrcr 
uy y <- y y 


fcnaoffca r k nr*n aaaaafcfcfcfccr 
uyctyuciuyuu ctct ctctct u u l. uy 




Ctt» t~Cty wVjVjCiy 


crpfc aaa acrcra 
y u ucty ctctyy ct 


era era ci a fc ncra 
3«yciycityyy 


fc opetanaerprr 
t- y y cty cty u y 


fcpaerfcafcfcaa erpeiererereraera 
uuety uctu u ctct yuyyyyyctyct 


O V w 


ct u uay atLy^ 


era fc erer era a a a 
y ct u y y y ctctct ci 


a a fc fc prrrrfc fc a 
act u UL.yy Lua 


aggecagggg 


naaarraaaaa a^afcaaa+*fca 
y dddy dddclcl dUdUdddUUd 


DOU 


ct ct d u ct u ct u cty 


fc a fc nrrerpa a rr 
uctuyyyuddy 


/-i a <-r erera <tp fc a 

^ »y y y «.y c u d 


y ctctu yet u uuy 


udyuudduuu uyyuuuyuud 


/aw 


era aafafpafl 

y eta ct i ct UuCiH 


a aerfrr«fcf7fc a rr 
d»yy u> *-y uciy 


ctuactctudu uy 


y y at cty u ct u 


dduuduuuuu uudydudyyd 


•7 fl n 


tcagaagaac 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt gcatcaaagg 


840 


atagagataa 


aagacaccaa 


ggaagcttta 


gacaagatag 


aggaagagca aaacaaaagt 


900 


aagaccaccg 


cacagcaagc 


ggcegctgat 


cttcagacct 


ggaggaggag atatgaggga 


960 


caattggaga 


agtgaattat 


ataaatataa 


agtagtaaaa 


attgaaccat taggagtagc 


1020 


acccaccaag 


gcaaagagaa 


gagtggtgca 


gagagaaaaa 


agagcagtgg gaataggagc 


1080 


tttgttcctt 


gggttcttgg 


gagcagcagg 


aagcactatg 


ggcgcagcgt caatgacget 


1140 


gaeggtacag 


gecagacaat 


tattgtctgg 


tatagtgcag 


cagcagaaca atttgetgag 


1200 


ggctattgag 


gcgcaacagc 


atctgttgca 


actcacagtc 


tggggcatca agcagctcca 


1260 


ggcaagaatc 


ctggctgtgg 


aaagatacct 


aaaggatcaa 


cagctcctgg ggatttgggg 


1320 


ttgctctgga 


aaactcattt 


gcaccactgc 


tgtgccttgg 


aatgctagtt ggagtaataa 


1380 
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atctctggaa 


cagatttgga 


atcacacgac 


ctggatggag tgggacagag 


aaattaacaa 


1440 


ttacacaagc 


ttaatacact 


ccttaattga 


agaatcgcaa aaccagcaag 


aaaagaatga ' 


1500 


acaagaatta 


ttggaattag 


ataaatgggc 


aagtttgtgg aattggttta 


acataacaaa 


1560 


ttggctgtgg 


tatataaaat 


tattcataat 


gatagtagga ggcttggtag 


gtttaagaat 


1620 


agtttttgct 


gtactttcta 


tagtgaatag 


agttaggcag ggatattcac 


cattatcgtt 


1680 


tcagacccac 


ctcccaaccc 


cgaggggacc 


cgacaggccc gaaggaatag 


aagaagaagg 


1740 


tggagagaga 


gacagagaca 


gatccattcg 


attagtgaac ggatctcgac 


ggtatcgata 


1800 


agcttgggag 


ttccgcgtta 


cataacttac 


ggtaaatggc ccgcctggct 


gaccgcccaa 


1860 


cgacccccgc 


ccattgacgt 


caataatgac 


gtatgttccc atagtaacgc 


caatagggac 


1920 


tttccattga 


cgtcaatggg 


tggagtattt 


acggtaaact gcccactt^g 


cagtacatca 


1980 


agtgtatcat 


atgccaagta 


cgccccctat 


tgacgtcaat gacggtaaat 


ggcccgcctg 


2040 


gcattatgcc 


cagtacatga 


ccttatggga 


ctttcctact tggcagtaca 


tctacgtatt 


2100 


agtcatcgct 


attaccatgg 


tgatgcggtt 


ttggcagtac atcaatgggc 


gtggatagcg 


2160 


gtttgactca 


cggggatttc 


caagtctcca 


ccccattgac gtcaatggga 


gtttgttttg 


2220 


gcaccaaaat 


caacgggact 


ttccaaaatg 


tcgtaacaac tccgccccat 


tgacgcaaat 


2280 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


tataagcaga gctcgtttag 


tgaaccgtca 


2340 


gatcgcctgg 


agacgccatc 


cacgctgttt 


tgacctccat agaagacacc 


gactctagag 


2400 


gatccactag 


tccagtgtgg 


tggaattctg 


cagatatcaa caagtttgta 


caaaaaagct 


2460 


gaacgagaaa 


cgtaaaatga 


tataaatatc 


aatatattaa attagatttt 


gcataaaaaa 


2520 


cagactacat 


aatactgtaa 


aacacaacat 


atccagtcac tatggcggcc 


gcattaggca 


2580 


ccccaggctt 


tacactttat 


gcttccggct 


cgtataatgt gtggattttg 


agttaggatc 


2640 


cggcgagatt 


ttcaggagct 


aaggaagcta 


aaatggagaa aaaaatcact 


ggatatacca 


2700 


ccgttgatat 


atcccaatgg 


catcgtaaag 


aacattttga ggcatttcag 


tcagttgctc 


2760 


aatgtaccta 


taaccagacc 


gttcagctgg 


atattacggc ctttttaaag 


accgtaaaga 


2820 


aaaataagca 


caagttttat 


ccggccttta 


ttcacattct tgcccgcctg 


atgaatgctc 


2 880 


atccggaatt 


ccgtatggca 


atgaaagacg 


gtgagctggt gatatgggat 


agtgttcacc 


2940 


cttgttacac 


cgttttccat 


gagcaaactg 


aaacgttttc atcgctctgg 


agtgaatacc 


3000 


acgacgattt 


ccggcagttt 


ctacacatat 


attcgcaaga tgtggcgtgt 


tacggtgaaa 


3060 


acctggccta 


tttccctaaa 


gggtttattg 


agaatatgtt tttcgtctca 


gccaatccct 


3120 


gggtgagttt 


caccagtttt 


gatttaaacg 


tggccaatat ggacaacttc 


ttcgcccccg 


3180 


ttttcaccat 


gggcaaatat 


tatacgcaag 


gcgacaaggt gctgatgccg 


ctggcgattc 


3240 



* 
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—j r*r**ri- 4- a 4- /-» a 

aygtccatCtt 


4- pt p« p» rr t* p +" n t~ 
tyccy tt uy u 


era t~ crap t" t~ p p 
yaLyyi~uuu>u. 


atcftcacrcaa 


aatgcttaat 


gaattacaac 


3300 


aguacugcga 


ugaguygcay 


cr pt p cictnctc Pit" 

ggcgyyy^y 


3aaaai*ctaa 


ateeggctta 


etaaaageca 


3360 


gataacagfca 


4-p at/""* /T+- aH" t~PY 

ugcgt.ac.uuy 


p»pfp<PYP't"Pf a t* t~ 
ty ty t uy a l l 


t- 1* t" cr c* aa fc a t 
ULLytyyu.au- 


aagaatatat 


actcratatQt 


3420 


atacccgaag 


uaugucaaaa 


agaggugugc 


faf" era a fr/"* a tt 
ua uy day tay 


cctattacao 


taacaattcra 


3480 


cagcgacagc 


tatCoyttgC 


tCddyyCdtd 


u a ty a uy u t a 


a t" afcc t cecra 


tetaataaQC 


3540 


acadccatgc 


a pra a t* pt a a ptp - " 

ayddtyddyc 


tty ttyutuy 


p cr t ei c* pera a p 
ty uy ttyaat 


aefccraaaacfc 

y u» uy w 


acraaaatcaa 


3600 


gaagggaugg 


na ptpt t* rrtr* 
t uy ay y u uy t 


t ty y luuquu 


era a at" era a per 


actettttcrc 

y w C w W w wy \* 


taacaacraac 

»y wy «*y **^*>^ 


3660 


ay y y at u y y u 


na a a t~ cr parrf* 
yaaatytay u 




pacefcafcaaa 


acracracracrc c 


attatcatct 

w U-* t+ W "**^ "*"*J fc^ V w 


3720 


/-rf- 4- 4- /— r+- f~f/-r a 4— 

ytttgtgydt 


/T't~aP , aPTaPYt"PT 

ytdcdydy uy 


a"Hat"t"at*f* era 


tau.yu.tu.yyy 


ccracoaatCfQ 


tgatccccct 


3780 


ggccagug ca 


tguc uy t uy u 


pana t" a a ant" 
tay a uaaay u 


pt* pp pert* era a 
t uttty uy aa 


ett* t"accpcro 

\w l u u civ-, ^.uyy 


taofcanatat 

y y »*y w v* ^ \~ 


3840 


cggggaugaa 


a pi p* +- «o rrfpa 

agcuggcgca 


tyatyattdt 


rrra tat* nnr n 
tyauaLyytt 


ay uy uy t ty y 


tctecattat 


3900 


cggyyaay aa 


/~» 4- rrt~i p» t* pt a t~ « 

y uy y u uy a uu 


Ltay t tatty 


pera a a a t*era p 
ty aaaa u^y au. 


atcaaaaaca 


ceat taaccfc 

"W- W UA U- W t*. (A W W W 


3960 




prpfa a +* a t~ a a a 
y y aaLaLdaci 


uy u t ay y t ll. 


pert" tahacar 
ty LLaLau.at 


aaeeacftetcr 


caacrtcaacc 


4020 


a t~ a frt" rta f* 1 4- pr 
d Lay LyaL uy 


fta f*a f~nf* t~ pt t* 
yauauy l ty l 


yLLL tatay i— 


a t tatcrtacrt 

cl l La uy u ay u 


ctatttttta 


tgeaaaatet 


4080 


aaL.ULaaL.at 


_ i_ 4- a 4-4- 4- 

dttyatdttt 


a+-ahraht*h t 
auautauuuu 


a pert* "h t* c fc. per 
aty l l t w l ty 


ti" raac tt* te 


tt*atacaaaa 


4140 


+"<tpt+~ fn2 hat* 

ug y u tyd lql 


ttaytatay u 


yy uyy tty t u 


pcracrt" p t a era 
t y »y t» u- u ay a 


craaeccaccra 
yyy ttty tyy 


ttccraaacita 


4200 


aycctatccc 


tdatttLL.Lt 


tuuyyuuuty 


a t* t" pt* a pcrpcr 
auLtLa^yty 


hacraattaa 

uaiwoy y tay 


taataaattt 


4260 


yyaat u dat l 


pt" > p/t~PTPfa afrf 
u uy uyy cxctuy 


i*rri~ erf" pant" t" 
t.y i»y u t ay i» l 


a ercrcrt" eft* crcr a 
ayyy uy uyy a 


aaatccccaa 


crctccccaacr 

y v v.. v». v» w y y 


4320 


tdyytdyddy 


tatytaaay t 


auytaututa 


a ttacft*cacfc 
cl l way ''^"j *^ 


aaccaaatcrt 


crcr aa aa fc c c c 

*n vA UA UA ^ ~w w 


4380 


<~i a ptptp' i~ p p 1 p» p» 

cdyyttcttt 


ay tayy tay a 


a rrt" a i - erp a a a 
ay Lauyuacia 


crcatcrcatct 


caattacrtca 


acaaccataa 

7*1 W UA UA "U^ N-r- fcA W V^**— y 


4440 


t"p*P"P"Pfp*PP'p , t* 
ttttytttt u 


aat Ltty tt t 


at>L.ttytttt 


t-aact decree 

L. W* L> \* ^ w w 


caattccacc 


cattctcccrc 


4500 


p« fi 3 4* /~rrf(^ f* pr 

tcudtyycty 


apt-aat* t-t-t-i - 
at uaa l. u u u u 


UL,iauuiftuy 


pacracrcreecra 
^°y Q yy uu y a 


acrccacctcfc 


acctetcraac 


4560 


tdttCCayda 


prt~ arr+Tiannfl 

y u ay uy ay y a 


yyttLLLLuy 


er a erer p p t" a er cr 
y ayy tt uay y 


wu u u uy t ciara 


aaapt" ccccc 


4620 


+-/~rf- f- rra p* a at* 
uy u tyataat 


uaautautyy 


ni~ at" at* 
ta^ay ta ua w 


p crap at" acrt" a 

U<~j^j U.CL w ^ j 


taa r*araaca 


aaataaaaaa 

t*y y uy c*y y "*» 


4680 


CtadatuatLj 


rrpfaa frtr t" na 
y ttaay u uy a 


t tay ty tty u 


t" pprrrrt-er pt~ P 
uttyy uytut 


appftpnpnpfr 
atty ty ty ty 


a pat* pciccaa 
a^y utyu.uyy 


4740 


aPTP»PTPT*~ rTfarr 

agcggucyay 


t" **■ p*t~pfprap , p , pf 
u ut uy yatty 


attyy t utyy 


rrt" t" p1~ P P pan 
y u uu uttty y 


aa pf"t"prjt~crrT 
y cat l uty uyy 


aacraraacth 


4800 


cgccgguy ug 


p» t~ « fif-YPTPra 
y ut ty y y aty 


aty uy au tt u 


ert" t*pat*Pap;p 
yuutautayt 


rrpnnt'pparfn 
y tyy ut tay y 


a p pa erer t*CTert" 
at uay y uyy u 


4860 


gccggacaac 


accctggcct 


gggtgtgggt 


gcgcggcctg 


gacgagctgt 


aegecgagtg 


4920 


gtcggaggtc 


gtgtccacga 


acttceggga 


cgcctccggg 


ccggccatga 


ccgagategg 


4980 


cgagcagccg 


tgggggcggg 


agttcgccct 


gcgcgacccg 


gccggcaact 


gcgtgcactt 


5040 



16/49 

cgtggccgag gagcaggact gacacgtgct acgagattta aatggtacct ttaagaccaa 5100 

tgacttacaa ggcagctgta gatcttagcc actttttaaa agaaaagggg ggactggaag 5160 

ggctaattca ctcccaacga agacaagatc tgctttttgc ttgtactggg tctctctggt 5220 

tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc 52 80 

aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta 5340 

actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt agtagttcat 5400 

gtcatcttat tattcagtat ttataacttg caaagaaatg aatatcagag agtgagagga 5460 

acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca aatttcacaa 5520 

ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc aatgtatctt 5580 

atcatgtctg gctctagcta tcccgcccct aactccgccc atcccgcccc taactccgcc 5640 

cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 5700 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 5760 

gacgtaccca attcgcccta tagtgagtcg tattacgcgc gctcactggc cgtcgtttta 5820 

caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc 5880 

cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg 5940 

cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg 6000 

gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct 6060 

ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg 6120 

ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag 6180 

ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg 6240 

gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc 6300 

tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat 6360 

gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgct tacaatttag 6420 

gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 6480 

caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 6540 

ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 6600 

gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 6660 

tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 672 0 

ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 6780 

tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 6840 

atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 6900 
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gagaattatg 


cagtgctgcc 


ataaccatga 


gtgacaacac 


ugcggccaac 




D?OU 


caacgatcgg 




gagctaaccg 


ettutctgea 


caacatgggg 


^r"> 4-» ^3 4— f— a a 

gauca uy caa 


f \J £>\J 


ctcgccttga 


tcgttgggaa 


ccggagctga 


atgaagecat 


accaaacgac 


gagegegaca 


/UOU 


ccacgatgcc 


tgtagcaatg 


geaacaaegfc 


tgegcaaact 


attaactggc 


gaaccaccca 


/ -L4k V 


ctctagcttc 


ccggcaacaa 


ttaatagact 


ggatggaggc 


ggataaagtt 


gcaggaccac 


/ZUv 


ttctgcgctc 


ggcccttccg 


gctggctggt 


ttactgc tga 


taaatcfcgga 


geeggtgage 


/Z O U 


gtgggtctcg 


cggtatcatt 


gcagcaccgg 


ggccagatgg 


caagcccucc 


cgra tcg^ag 


*7 1 o n 


ttatctacac 


gacggggagt 


caggcaacta 


tggatgaacg 


aaatagacag 


ategctgaga 


73oU 


taggtgcctc 


actgattaag 


cattggtaac 


tgtcagacca 


agfcttactca 


tatatacttc 


/44 0 


agattgattt 


aaaacttcat 


tcttaattta 


aaaggatcta 


ggtgaagatc 


ctctcngaca 


7500 


atctcatgac 


caaaatccct 


taacgtgagt 


tttcgttcca 


ctgagegtea 


gaccccgtag 


Teen 


aaaagatcaa 


aggatct tec 


tgagatcctt 


4-4-4-4-4- /~ — — 

ttuttcrgcg 


cgtaatctgc 


tgcfctgcaaa 




caaaaaaacc 


accgctacca 


gcggtggttt 


gtttgccgga 


tcaagagcta 


«™s — * j-i 4" ^ ^ 4* 

ccaactcuct 


/OCJU 


tfcccgaaggt 


aactggcttc 


ageagagege 


agataccaaa 


uactgnccct 


ctagtgtagc 


7740 


cgfcagtfcagg 


ccaccacttc 


aagaactctg 


tagcaccgcc 


tacataccfcc 


getctgetaa 


*7 □ n a 


tcccgccacc 


agcggccgcc 


gccagtggcg 


acaagucgeg 


ccccaccggg 


ccggacucaa 


/ OOU 


gacgauagct 


aceggauaag 


gcgcagcggu 


egggcugaac 




ugcacacagc 


A 

/ u 


ccagcc cgga 


gcgaacgacc 


cacaccgaac 


tigagataccc 


ac ageg ugag 


ccacgagaaa 




gcgccacgct 


tcccgaaggg 


agaaaggegg 


acaggtatcc 


ggtaagegge 


agggteggaa 


o r\ A A 


caggagagcg 


c acgaggg ag 


cttccagggg 


gaaacgcctg 


gtatctttat 


agtcctgtcg 


8100 


ggcu ccgcca 


cctctgactfc 


gagegtcgat 


tuccgegaug 


ctegtcaggg 


gg9cggagcc 


O 1 £. A 


cacggaaaaa 


cgccagcaac 


gcggccuc u u 


caegge cccc 


ggccttttgc 


eggecuut eg 


O O O A 


cccacaccfut 


^ ^ 4* 4* ^ y"^r jpn* 

cc tccccgcg 


ttatcccctg 


attctgtgga 


taacegtatt 


accgcctttg 


O O O A 


agtgagctga 


taccgctcgc 


cgcagccgaa 


cgaccgagcg 


cagegagtea 


gtgagegagg 


8340 


aagcggaaga 


gcgcccaata 


cgcaaaccgc 


ctctccccgc 


gcgttggccg 


attcattaat 


8400 


gcagctggca 


cgacaggttt 


cccgactgga 


aagcgggcag 


tgagegcaac 


gcaattaatg 


O A /" A 

8460 


tgagttagct 


cactcattag 


gcaccccagg 


ctttacactt 


tatgettccg 


getegtatgt 


8520 


tgtgtggaat 


tgtgagcgga 


taacaatttc 


acacaggaaa 


cagctatgac 


catgattacg 


8580 


ccaagcgcgc 


aattaaccct 


cactaaaggg 


aacaaaagct 


ggagctgcaa 


gctt 


8634 
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<210> 5 
<211> 9320 
<212> DNA 
<213> Artificial 



<220> 

<223> pLenti6/UbC/V5-DEST 
<400> 5 



aatgtagtct 


tatgcaatac 


ccccgcagcc ccgcaacacg 


gtaacgatgd 


/*»+" 4-a/""t/"*aaf.a 
gc Cdy CddCd 


D U 


tgccttacaa 


ggagagaaaa 


agcaccgcgc acgccgaccg 


rr4~ t~Tf~l 3 a rrt- a a 

guggaagtaa 


ggcyy cacga 


ion 


cCgcyCCcca 


c c ayy aag gc 


a armaria ri/^nrr t* f t* fra f A t" rtn 


aff cicia r»r*ra a 
dc cy y di»y ao. 




180 


yccgcatugc 




4- a 4- 4r 4» a a rrt" rr r" i~ anrt* pna 
ttttttaoyt.y Lttayutuya 


t"apat"aaapfl 




240 


gc cagaccag 


?s 4- /-» 4- rta rrn #"i 4~ 

acccgagccc 


n/trt3rrr'f'ol"ri 4- nnp 4/" a a f 4~ a 

gygayccccc cyycuaaULo 


yyy ddcccdc 




•J V V 


ccaacaaagc 


ccgcctcgag 


H«/^t*f roanf" =jrT**<nr4-rrf"<T/^»/*' 
tyCLLCaay l a 3 l *3 , -9 l -y t ' t ' 


cy LLtyL cy c 


y uyaL l. v-r uy y 


360 

J u u 


taactagaga 


t ccc t cagac 


/-t 4- 4- 4- 4~ n 4" arf4*rtf , rtfta a a 

CCCLtLdy LC ayuytygaaa 


a 4- /-i 4- /"■« 4- a /-tp a 
dCCCCCdyCd 


y cyy cy cccy 


490 


aacagggacc 


cgaaagcgaa 


aggyadacca gayyayctct 


cucy dcy ccty 




tow 


g ccgaag eg c 


gcacggcaag 


aggcgagggg cggcgactgg 


L.y cty ucicy cc 


aaaaahfhhff 
ddddd u u u uy 


R4. 0 
j *± >j 


accagcggag 


n 4- arta a fffrss 

yctagaoyga 


gagayacgyy cgcgayaycy 




ycyyyyydya 


600 


at tag accgc 


ga tgggaaaa 


aatccgguta ayyccayyyy 


rra aa/*raaaoa 
y ddcLy ddddd 


afafaaahfa 
dCdCdddCCd 


D O U 


aaacatatag 


tatgggcaag 


cagggagcta gaacgattcg 


cage t a a ccc 


4- *~*r* nnf 1 « 4" 4" 

cyy Cc cy c ca 


/ 6 V 


gaaacatcag 


aaggctgtag 


acaaatactg ggacagctac 


aaccatccct 


tcagacagga 


too 


tcagaagaac 


ttagatcatt 


atataataca gtagcaaccc 


tctattgtgt 


gcatcaaagg 


840 


atagagataa 


aagacaccaa 


ggaagcttta gacaagatag 


aggaagagca 


aaacaaaagt 


900 


aagaccaccg 


cacagcaagc 


ggccgctgat cttcagacct 


ggaggaggag 


atatgaggga 


960 


caattggaga 


agtgaattat 


ataaatataa agtagtaaaa 


attgaaccat 


taggagtagc 


1020 


acccaccaag 


gcaaagagaa 


gagtggtgca gagagaaaaa 


agagcagtgg 


gaataggagc 


1080 


tttgttcctt 


gggttcttgg 


gagcagcagg aagcactatg 


ggcgcagcgt 


caatgacget 


1140 


gacggtacag 


gccagacaat 


tattgtctgg tatagtgcag 


cagcagaaca 


atttgetgag 


1200 


ggctattgag 


gcgcaacagc 


atctgttgca actcacagtc 


tggggcatca 


agcagctcca 


1260 


ggcaagaatc 


ctggctgtgg 


aaagatacct aaaggatcaa 


cagctcctgg 


ggatttgggg 


1320 


ttgctctgga 


aaactcattt 


gcaccactgc tgtgccttgg 


aatgctagtt 


ggagtaataa 


1380 
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atctctggaa 


cagatttgga 


atcacacgac 


ctggatggag 


tgggacagag 


aaattaacaa 


1440 


ttacacaagc 


ttaatacact 


ccttaattga 


agaatcgcaa 


aaccagcaag 


aaaagaatga 


1500 


acaagaatta 


ttggaattag 


ataaatgggc 


aagtttgtgg 


aattggttta 


acataacaaa 


1560 


ttggctgtgg 


tatataaaat 


tattcataat 


gatagtagga 


ggcttggtag 


gtttaagaat 


1620 


agtttttgct 


gtactttcta 


tagtgaatag 


agttaggcag 


ggatattcac 


cattatcgtt 


1680 


tcagacccac 


ctcccaaccc, 


cgaggggacc 


cgacaggccc 


gaaggaatag 


aagaagaagg 


1740 


tggagagaga 


gacagagaca 


gatccattcg 


attagtgaac 


ggatctcgac 


ggtatcggat 


1800 


ctggcctccg 


cgccgggttt 


tggcgcctcc 


cgcgggcgcc 


cccctcctca 


cggcgagcgc 


1860 


tgccacgtca 


gacgaagggc 


gcaggagcgt 


cctgatcctt 


ccgcccggac 


gctcaggaca 


1920 


gcggcccgct 


gctcataaga 


ctcggcctta 


gaaccccagt 


atcagcagaa 


ggacatttta 


1980 


ggacgggact 


tgggtgactc 


tagggcactg 


gttttctttc 


cagagagcgg 


aacaggcgag 


2040 


gaaaagtagt 


cccttctcgg 


cgattctgcg 


gagggatctc 


cgtggggcgg 


tgaacgccga 


2100 


tgattatata 


aggacgcgcc 


gggtgtggca 


cagctagttc 


cgtcgcagcc 


gggatttggg 


2160 


tcgcggttct 


tgtttgtgga 


tcgctgtgat 


cgtcacttgg 


tgagtagcgg 


gctgctgggc 


2220 


tggccggggc 


tttcgtggcc 


gccgggccgc 


tcggtgggac 


ggaagcgtgt 


ggagagaccg 


2280 


ccaagggctg 


tagtctgggt 


ccgcgagcaa 


ggttgccctg 


aactgggggt 


tggggggagc 


2340 


gcagcaaaat 


ggcggctgtt 


cccgagtctt 


gaatggaaga 


cgcttgtgag 


gcgggctgtg 


2400 


aggtcgttga 


aacaaggtgg 


ggggcatggt 


gggcggcaag 


aacccaaggt 


cttgaggcct 


2460 


tcgctaatgc 


gggaaagctc 


ttattcgggt 


gagatgggct 


ggggcaccat 


ctggggaccc 


2520 


tgacgtgaag 


tttgtcactg 


actggagaac 


tcggtttgtc 


gtctgttgcg 


ggggcggcag 


2580 


ttatgcggtg 


ccgttgggca 


gtgcacccgt 


acctttggga 


gcgcgcgccc 


tcgtcgtgtc 


2640 


gtgacgtcac 


ccgttctgtt 


ggcttataat 


gcagggtggg 


gccacctgcc 


ggtaggtgtg 


2700 


cggtaggctt 


ttctccgtcg 


caggacgcag 


ggttcgggcc 


tagggtaggc 


tctcctgaat 


2760 


cgacaggcgc 


cggacctctg 


gtgaggggag 


ggataagtga 


ggcgtcagtt 


tctttggtcg 


2820 


gttttatgta 


cctatcttct 


taagtagctg 


aagctccggt 


tttgaactat 


gcgctcgggg 


2880 


ttggcgagtg 


tgttttgtga 


agttttttag 


gcaccttttg 


aaatgtaatc 


atttgggtca 


2940 


atatgtaatt 


ttcagtgtta 


gactagtaaa 


ttgtccgcta 


aattctggcc 


gtttttggct 


3000 


tttttgttag 


acgaagcttg 


gtaccgagct 


cggatccact 


agtccagtgt 


ggtggaattc 


3060 


tgcagatatc 


aacaagtttg 


tacaaaaaag 


ctgaacgaga 


aacgtaaaat 


gatataaata 


3120 


fraatatatt 


aaattaaatt 


ttocataaaa 


aaraaactac 


ataatactcrt 


aaaacacaac 


3180 


atatccagtc 


actatggcgg 


ccgcattagg 


caccccaggc 


tttacacttt 


atgcttccgg 


3240 
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r* 4" c ci\~ a +* a a t" 
LUty La ucici u 


cr t" cr t* rrrra +- t* t* 
y uy L.yycit.t.u 


t~ aacrt t acr era 

uy ciy w w ciy y wv 


fc c caaccraaa 


ttttcaggag 


ctaaggaagc 


3300 


t* a a a a fc rtCf a er 


a a a a a a a t* ca 
ncicici cLcicL uuo 


ctgga tatac 


caccgtfcgat 


atatcccaat 


ggcatcgtaa 


3360 


agaaLal 1.11 


crarrnca c 
y ciyy wet u ull 


a at* cacrt fcerc 

ciy w wdy L, uy w 


fceaatatace 


tataaccaga 


ccgttcagct f 


3420 




yULLlL. w wCLCL 


a craccataaa 

ciy c* w wy w nnci 




cacaagtttt 


atccggcctt 


3480 


4* a •+* t~ r* a ca fc fc 


w w uy^^V/yuv 


t" aafcetaaterc 


tcatccaaaa 


ttccatataa 


caatgaaaga 


3540 


c rT cr f~ cr a cr c t~ a 

^y y wy o.y ^- ^-y 


cr t cr a fc a fc crcrcr 
y vy ciua i»y yy 


a fc aatat tea 


cccttgttac 


accgttttcc 


atgagcaaac 


3600 


+-aaa a rafc fc fc 


tcatcactrt 


aaacr tctaata 

3 3 3 w j w>fc* w w» 


ccacgacgat 


ttccggcagt 


ttctacacat 


3660 


a fc a fc fc ncrr'a a 


cr a t cr t* cr a n cr t 

y»^y uyyuy w 


at - 1* accratcra 

y wl^dw^ywWjwV 


aaaccfcggcc 


tatttcccta 


aagggtttat 


3720 


fc a a era a fc a fc a 
wy ciy cio v» ci wy 


tttttcatct 

w ^ w w w w wj w w w 


cage caa tec 


ctaaataaat 


ttcaccagtt 


ttgatttaaa 


3780 


f~*ci\~ rifrcca a fc 
wy i~yy ww>ctetu 


a fc crcr ac a a c t 


fc ci~t.cc[ciccc 

w w w w w>J w w w W 


cgttttcacc 


atgggcaaat 


attatacgea 


3840 


ciy y wy ci i»« ci ciy 


crt - act* a at ac 


cactaacaat 

wy w wyy w 3 ^* w 


tcaggtfccat 


catgccgtct 


qtcrataqctt 


3900 


r*r«a fc crt - caa c 


acraat"crc tfca 


ataaattaca 


acagtactgc 


aataaataac 

3«*v»3«3 "33 


aaqqcaqqqc 


3960 


afc a aa era t* c t 


craa t c c acr c t 

33ClwV>wyyww 


tactaaaagc 


cagataacag 


tatgegtatt 


tgcqcqctga 

v 3 3 3 3 


4 020 


wwwV«i^yw>sjSH w 


ataacraata t 

CL ^dU>4 UWil wCl w 


atactgatat 


gtatacccga 


agtatgtcaa 


aaagaggtgt 


4080 


orfcafQaacrc 
Hv>w c* <->y w»c*. 3 w 


aacatattac 


acrfcerac ao fc fc 


gaeagegaca 


gctatcagtfc 


gctcaaggca 


4140 


fcafcafcaa fcafc 


caat*afcct"cc 


aafcctcrafcaa 


gcacaaccafc 


gcagaatgaa 


qcccqtcqtc 


4200 


■t" cr c a fc ere c cr a 


acartcroaaa 
awyu L»y y cLcio. 


acaaaaaafc c 

y\*33 dc*o»d w w 


aaaaaoacfat 
^yy a 33s 


aactaaaatc 

33 *~ »-3 c *33 ^ w 


geceggttta 


4260 


t* 1~ era a a t" era a 
w uyaaa ^-y cslcu 


caartctttt 


actaacaaaa 

y w vyavvjnya 


acaaaaac t a 

333 w Wj 


gtgaaatgea 


gtttaaggfct 


4320 


fc a ca rr 1" a fc a 


aaaaacracraa 


c cert fcafc cat 


c tat fctataa 
3 3 33 


atgtacagag 


tgatattatt 


4380 


a a c acac c ca 


crcfccracaaafc 


ggfcgatcccc 


ctaaccaata 


cacgtctgct 


gtcagataaa 


4440 


cftctcccQtcr 
y w w- l>v^i wy 3 


aactttaccc 


acr fc oa t a c a fc 

yy wyy wy w» u 


atcaaaaafca 

c * wv 3333 6iW 3 


aaaactaaca 


catgatgacc 


4500 


ac caa tatcrcr 


Cca.atcrtQCC 


aafcetccat fc 


atcaaaaaaa 

w ^- v 3333 uu 3 


aaataactaa 

****wj k>3" 


tctcagccac 


4560 


cacaa a aafc a 

w y w y aauw 3 


acat*caaaaa 


race at fcaac 

w w w wV w ^ wH*^^ 


ctaatattct 


aaaaaatata 

3333 *«*w» W W> W W* 


aatgtcaggc 


4620 


t" c cat* fc at ac 


araaccaotc 


fcacaaatcaa 

L-y wayy wvya 


ccataataac 


tggatatgtt 


gtgttttaca 


4680 


ni™ a fc fc a fccrfc a 

y UAL UdLy L*Cl 


at-ct"atttt* t 
y uuuy wi»www 


fc a fc crpa a a a fc 


rtaat' tt aat* 

\*# ^> CA CL I^p I^p uu O 


atattaatat 


t tat at catt 


4740 


fcfcacafcttct 


cat - 1 eacrc tt 

w^j www www 


fccfcfcafcacaa 

w w w w w CL w t* t*. 


aataattaat 


atccagcaca 


ataacaacca 

3 w 33^33 v - v *3 


4800 


c fc ccracr fc c t a 

w w uy»y www ci 


cracracfccccrc 
3 & y y y w w w y w 


y y w w v«>y o»yy 


taaacctatc 


cctaaccctc 


tcctcggtct 


4860 


cgattctacg 


cgtaccggtt 


agtaatgagt 


ttggaattaa 


ttctgtggaa 


tgtgtgtcag 


4920 


ttagggtgtg 


gaaagtcccc 


aggctcccca 


ggcaggcaga 


agtatgcaaa 


geatgeatet 


4980 


caattagtca 


gcaaccaggt 


gtggaaagtc 


cccaggcfccc 


ccagcaggca 


gaagtatgea 


5040 
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aagcatgcat 


ctcaattagt cagcaaccat 


agtcccgccc 


ctaactccgc 


ccatcccgcc 


5100 


cctaactccg 


cccagttccg cccattctcc 


gccccatggc 


tgactaattt 


tttttattta 


5160 


tgcagaggcc 


gaggccgcct ctgcctctga 


gctattccag 


aagtagtgag 


gaggcttttt 


5220 


tggaggccta 


ggcttttgca aaaagctccc 


gggagcttgt 


atatccattt 


tcggatctga 


5280 


tcagcacgtg 


ttgacaatta atcatcggca 


tagtatatcg 


gcatagtata 


atacgacaag 


5340 


gtgaggaact 


aaaccatggc caagcctttg 


tctcaagaag 


aatccaccct 


cattgaaaga 


5400 


gcaacggcta 


caatcaacag catccccatc 


tctgaagact 


acagcgtcgc 


cagcgcagct 


5460 


ctctctagcg 


acggccgcat cttcactggt 


gtcaatgtat 


atcattttac 


tgggggacct 


5520 


tgtgcagaac 


tcgtggtgct gggcactgct 


gctgctgcgg 


cagctggcaa 


cctgacttgt 


5580 


atcgtcgcga 


tcggaaatga gaacaggggc 


atcttgagcc 


cctgcggacg 


gtgccgacag 


5640 


gtgcttctcg 


atctgcatcc tgggatcaaa 


gccatagtga 


aggacagtga 


tggacagccg 


5700 


acggcagttg 


ggattcgtga attgctgccc 


tctggttatg 


tgtgggaggg 


ctaagcacaa 


5760 


ttcgagctcg 


gtacctttaa gaccaatgac 


ttacaaggca 


gctgtagatc 


ttagccactt 


5820 


tttaaaagaa 


aaggggggac tggaagggct 


aattcactcc 


caacgaagac 


aagatctgct 


5880 


ttttgcttgt 


actgggtctc tctggttaga 


ccagatctga 


gcctgggagc 


tctctggcta 


5940 


actagggaac 


ccactgctta agcctcaata 


aagcttgcct 


tgagtgcttc 


aagtagtgtg 


6000 


tgcccgtctg 


ttgtgtgact ctggtaacta 


gagatccctc 


agaccctttt 


agtcagtgtg 


6060 


gaaaatctct 


agcagtagta gttcatgtca 


tcttattatt 


cagtatttat 


aacttgcaaa 


6120 


gaaatgaata 


tcagagagtg agaggaactt 


gtttattgca 


gcttataatg 


gttacaaata 


6180 


aagcaatagc 


atcacaaatb tcacaaataa 


agcatttttt 


tcactgcatt 


ctagttgtgg 


6240 


tttgtccaaa 


ctcatcaatg " tatcttatca 


tgtctggctc 


tagctatccc 


gcccctaact 


6300 


ccgcccatcc 


cgcccctaac tccgcccagt 


tccgcccatt 


ctccgcccca 


tggctgacta 


6360 


atttttttta 


tttatgcaga ggccgaggcc 


gcctcggcct 


ctgagctatt 


ccagaagtag 


6420 


tgaggaggct 


tttttggagg cctagggacg 


tacccaattc 


gccctatagt 


gagtcgtatt 


6480 


acgcgcgctc 


actggccgtc gttttacaac 


gtcgtgactg 


ggaaaaccct 


ggcgttaccc 


6540 


aacttaatcg 


ccttgcagca catccccctt 


tcgccagctg 


gcgtaatagc 


gaagaggccc 


6600 


gcaccgatcg 


cccttcccaa cagttgcgca 


gcctgaatgg 


cgaatgggac 


gcgccctgta 


6660 


gcggcgcatt 


aagcgcggcg ggtgtggtgg 


ttacgcgcag 


cgtgaccgct 


acacttgcca 


6720 


gcgccctagc 


gcccgctcct ttcgctttct 


tcccttcctt 


tctcgccacg 


ttcgccggct 


6780 


¥■ t- ncccQtca 


acrctcfcaaafc caaaaact.cc 


cttfeagggtt 


ccgatttagt 


qctttacqqc 


6840 


acctcgaccc 


caaaaaactt gattagggtg 


atggttcacg 


tagtgggcca 


tcgccctgat 


6900 
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agacggtttt 

aaactggaac 

cgatttcggc 

acaaaatatt 

tatttgttta 

ataaatgctt 

ccttattccc 

gaaagtaaaa 

caacagcggt 

ttttaaagtt 

cggtcgccgc 

gcatcttacg 

taacactgcg 

tttgcacaac 

agccatacca 

caaactatta 

ggaggcggat 

tgctgataaa 

agatggtaag 

tgaacgaaat 

agaccaagtt 

gatctaggtg 

gttccactga 

tctgcgcgta 

gccggatcaa 

accaaatact 

accgcctaca 

gtcgtgtctt 

ctgaacgggg 

atacctacag 



tcgccctttg 
aacactcaac 
ctattggtta 
aacgcttaca 
tttttctaaa 
caataatatt 
ttttttgcgg 
gatgctgaag 
aagatccttg 
ctgctatgtg 
atacactatt 
gatggcatga 
gccaacttac 
atgggggatc 
aacgacgagc 
actggcgaac 
aaagttgcag 
tctggagccg 
ccctcccgta 
agacagatcg 
tactcatata 
aagatccttt 
gcgtcagacc 
atctgctgct 
gagctaccaa 
gttcttctag 
tacctcgctc 
accgggttgg 
ggttcgtgca 
cgtgagctat 



acgttggagt 

cctatctcgg 

aaaaatgagc 

atttaggtgg 

tacattcaaa 

gaaaaaggaa 

cattttgcct 

atcagttggg 

agagttttcg 

gcgcggtatt 

ctcagaatga 

cagtaagaga 

ttctgacaac 

atgtaactcg 

gtgacaccac 

tacttactct 

gaccacttct 

gtgagcgtgg 

tcgtagttat 

ctgagatagg 

tactttagat 

ttgataatct 

ccgtagaaaa 

tgcaaacaaa 

ctctttttcc 

tgtagccgta 

tgctaatcct 

actcaagacg 

cacagcccag 

gagaaagcgc 



ccacgttctt 

tctattcttt 

tgatttaaca 

cacttttcgg 

tatgtatccg 

gagtatgagt 

tcctgttttt 

tgcacgagtg 

ccccgaagaa 

atcccgtatt 

cttggttgag 

attatgcagt 

gatcggagga 

ccttgatcgt 

gatgcctgta 

agcttcccgg 

gcgctcggcc 

gtctcgcggt 

ctacacgacg 

tgcctcactg 

tgatttaaaa 

catgaccaaa 

gatcaaagga 

aaaaccaccg 

gaaggtaact 

gttaggccac 

gttaccagtg 

atagttaccg 

cttggagcga 

cacgcttccc 



taatagtgga 
tgatttataa 
aaaatttaac 
ggaaatgtgc 
ctcatgagac 
attcaacatt 
gctcacccag 
ggttacatcg 
cgttttccaa 
gacgccgggc 
tactcaccag 
gctgccataa 
ccgaaggagc 
tgggaaccgg 
gcaatggcaa 
caacaattaa 
cttccggctg 
atcattgcag 
gggagtcagg 
attaagcatt 
cttcattttt 
atcccttaac 
tcttcttgag 
ctaccagcgg 
ggcttcagca 
cacttcaaga 
gctgctgcca 
gataaggcgc 
acgacctaca 
gaagggagaa 



ctcttgttcc 

gggattttgc 

gcgaatttta 

gcggaacccc 

aataaccctg 

tccgtgtcgc 

aaacgctggt 

aactggatct 

tgatgagcac 

aagagcaact 

tcacagaaaa 

ccatgagtga 

taaccgcttt 

agctgaatga 

caacgttgcg 

tagactggat 

gctggtttat 

cactggggcc 

caactatgga 

ggtaactgtc 

aatttaaaag 

gtgagttttc 

atcctttttt 

tggtttgttt 

gagcgcagat 

actctgtagc 

gtggcgataa 

agcggtcggg 

ccgaactgag 

aggcggacag 



6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 



gtatccggta 


agcggcaggg 


tcggaacagg agagcgcacg agggagcttc cagggggaaa 


8760 


cgcctggtat 


ctttatagtc 


ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 


8820 


gtgatgctcg 


tcaggggggc 


ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 


8880 


gttcctggcc 


ttttgctggc 


cttttgctca catgttcttt cctgcgttat cccctgattc 


8940 


tgtggataac 


cgtattaccg 


cctttgagtg agctgatacc gctcgccgca gccgaacgac 


9000 


cgagcgcagc 


gagtcagtga 


gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 


9060 


ccccgcgcgt 


tggccgattc 


attaatgcag ctggcacgac aggtttcccg actggaaagc 


9120 


gggcagtgag 


cgcaacgcaa 


ttaatgtgag ttagctcact cattaggcac cccaggcttt 


9180 


acactttatg 


cttccggctc 


gtatgttgtg tggaattgtg agcggataac aatttcacac 


9240 


aggaaacagc 


tatgaccatg 


attacgccaa gcgcgcaatt aaccctcact aaagggaaca 


93 00 


aaagctggag 


ctgcaagctt 




9320 


<210> 6 








<211> 8889 






<212> DNA 








<213> Artificial 






<220> 








<223> pLPl 






<400> 6 
ttggcccatt 


gcatacgttg 


tatccatatc ataatatgta catttatatt ggctcatgtc 


60 


caacattacc 


gccatgttga 


cattgattat tgactagtta ttaatagtaa tcaattacgg 


120 


ggtcattagt 


tcatagccca 


tatatggagt tccgcgttac ataactfcacg gtaaatggcc 


180 


cgcctggctg 


accgcccaac 


gacccccgcc cattgacgtc aataatgacg tatgttccca 


240 


tagtaacgcc 


aatagggact 


ttccattgac gtcaatgggt ggagtattta cggtaaactg 


300 


cccacttggc 


agtacatcaa 


gtgtatcata tgccaagtac gccccctatt gacgtcaatg 


360 


acggtaaatg 


gcccgcctgg 


cattatgccc agtacatgac cttatgggac tttcctactt 


420 


ggcagtacat 


ctacgtatta 


gtcatcgcta ttaccatggt gatgcggttt tggcagtaca 


480 


tcaatgggcg 


fcggatagcgg 


tttgactcac ggggatttcc aagtctccac cccattgacg 


540 


tcaatgggag 


tttgttttgg 


caccaaaatc aacgggactt tccaaaatgt cgtaacaact 


600 


ccgccccatt 


gacgcaaatg 


99 c 99taggc gtgtacggtg ggaggtctat ataagcagag 


660 


ctcgtttagt 


gaaccgtcag 


atcgcctgga gacgccatcc acgctgtttt gacctccata 


720 
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gaagacaccg 


ggaccg accc 




raaaarhtac 


atataatacc 


aaactcciqat 


780 


cctgagaact 


tcagggtgag 


4- r»4- a t-z-TPTPra P 1 




ttcfcttcccc 


ttcttttcta 


840 


tggttaagtt 


catgtcatag 


gaaggggaga 


aguaacaggg 


4- a r<a /~i a 4~ a f" f - 


prarra a a f* pa 
yavvddatva 


900 


gggtaatttt 


gcatttgtaa 


tc ucaaaaaa 


4-rrr"t-4-4-r , t~t - r' 


ht-i*fcaatata 

LUL u- a a l> a w 


cttttttatt 

^ w m V w W ^ V* 


960 


tatcttattt 


ctaatacttt 


ccctaatctc 


ttLCtutcag 


rrrrr* aat*aalrr 
yyuadLaauy 


a t* ar«aat"cft a 


1020 


tcatgcctct 


t tgcaccatit 


ccaaagaata 


atdy uy aUaa 




aaaacaataa 


1080 


caacatuccL 


yCatataaaL 


af f 4- r»4-rrP»a +* 


d uaaa Luy lci 


actcratcrtaa 


aacrcrtttcat 


1140 


attgctaata 


gcagctacaa 


f- <-(yia c% ^a 4~ a /a 
tCCayCLaLL 


a4-4-r , t-rrrf-i-h 

aLLULy^LLL, 


t*af~ t" t tat era 


ttaaaataacr 

i-y y y «. a ay 


1200 


gctggattat 


uccgagccca 


agci-aggccc 




r'a tcrttpata 


cctcttatct 


1260 


tcctcccaca 


gcccctgggc 


aacgt.gct.gg 


u*- L.y uy uy v-» u 


cr cr r 1 c c a t c a c 


tttaacaaaa 

S3 v» aaay 


1320 


cacgtgagat 


ctgaattcga 


gauCugccgc 


cgccatgy y u 


y L-yay ay ^y 


raatattaaa 

^ ay v^. a u- i— a ay 


1380 


cgggggagaa 


4- 4- a r*r a 4~ a 4* 

txagaccgau 


yy 9 dddddd U 




^^dyyyy y a.a 


agaaaaaata 


1440 


uaaau uaaaa 


cacacagca u 


yyy ^dog cciy 


nnafTrt" ansa 


ccrattcacacr 

v» y a Li^viVjv »«*-y 


ttaatcctcrcr 


1500 


cctgttagaa 


acatcagaag 


4~ <ar4~ artapa 
yCtytayaLd 


ad LaL uyyyd 


raarharaac 

ay i— a waa^ 


catcccttca 


1560 


gacaggatca 


gaagaac fc t a 


*^ 4^ a 4~ 4~ af'a 

gaUCatLaLa 


+» a a 4- a a n 4- a 
uaataLdy L.d 


y^aavvvvv u 


attatataca 

a u» u- y y L»y v^a 


1620 


tcaaaggata 


gagataaaag 


acaccaagga 


agctttoyoc 


a a na ♦* a fia Ptpt 
ddyaudydyy 


sacraaraaaa 

aay ay v_ a aaa 


1680 


caaaagtaag 


aaaaaagcac 


agCaayCayL 


a rfftrra r*a r*a 
ay Lya^a^a 


acracacaaca 


atcaaatcaa 


1740 


ccaaaattac 


cccatagcgc 


ar^aa oatrpa 
ag^aCaUCLa 






ccatatcacc 


1800 


cagaacuu ua 


act L-yudty y y 


4- a a a a rrf" anh 
uctctctcty uoy *- 


ay ddy ay day 


actttcacicc 


cagaagtgat 


1860 


acccacyuuc 


tcagcattat 


Cdy dciyy ciy u 




cratthaaaca 


ecatgetaaa 


1920 


cacagtgggg 




r» a rr r» r« a \~ rt r* a 
LayLta tyL>a 


aaf- rjf faaaa 


aaoaccatca 

y ay av»> w a w s»» t* 


a t era eta a acr c 


1980 


tgcagaatgg 


ganagagugc 


a I" rr<3 »~t 4~ rTf' a 

auccoytyca 


tycayyyuu u 


a 1 1 crca rr'acr 

a i_ uy vavway 


accaaataaa 

y ay a v» ^ 


2040 


agaaccaagg 


ggaagtgaca 


4* a /**♦ ^» a r~t/T a a /■* 

tayCayyaoC 


4- a y-i +- a o+" a r» r* 
Lautay tawL> 


pff oarrrraj a n 
w l> l. way y a»w 


aaataociatcf 

a a a l- ay y a i_ y 


2100 


gatgac acat 


aatccaccua 


^" o^T'a «— »4~ a rm 
WCCCaytayy 


dy ddd i>v l. d c 


a a a acta tcicf a 
ddaay a v-y y a 


taatcctaaa 

u. aa v^vwyyy 


2160 


atfcaaataaa 


auaguaagaa 


4" r*r4" a 4- a fr f f C 

Lyuatayotc 


f-anpa rro a 4- 4- 
UaLt ay talt 


f t pro a paHaa 


aacaacraacc 

y aay y a\»« o 


2220 


aaaggaaccc 


uuuagagacc 


acgcagaccg 


p-a 4~ 4~ /~t 4— a 4* a a a 


anhnfaa rfa PT 
dCLLLddydy 


pppranpaanp 
wv*3»yv»ayv 


2280 


ttcacaagag 


gtaaaaaatt 


ggacgacaga 


a a r>« /^+* 4" <a*4~ 4~ <~t 

aaccctyLLy 


ctfcipaa aaf n 
gCCvddddUy 


p* pfa arrpapia 
Lyadvvvay d 


2340 


ttgtaagact 


attttaaaag 


cattgggacc 


aggagcgaca 


ctagaagaaa 


tgatgacagc 


2400 


atgtcaggga 


gtggggggac 


ccggccataa 


agcaagagtt 


ttggctgaag 


caatgageca 


2460 


agtaacaaat 


ccagctacca 


taatgataca 


gaaaggcaat 


tttaggaacc 


aaagaaagac 


2520 
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tgttaagtgt ttcaattgtg gcaaagaagg gcacatagcc aaaaattgca gggcccctag 2580 

gaaaaagggc tgttggaaat gtggaaagga aggacaccaa atgaaagatt gtactgagag 2640 

acaggctaat tttttaggga agate tggcc ttcccacaag ggaaggccag ggaattttct 2700 

tcagagcaga ccagagccaa cagccccacc agaagagagc ttcaggtttg gggaagagac 2760 

aacaactccc tctcagaagc aggagecgat agacaaggaa ctgtatcctt tagcttccct 2820 

cagatcactc tttggcagcg acccctcgtc acaataaaga taggggggca attaaaggaa 2880 

gctctattag atacaggagc agatgataca gtattagaag aaatgaattt gecaggaaga 2940 

tggaaaccaa aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcag 3000 

atactcatag aaatctgegg acataaagct ataggtacag tattagtagg acctacacct 3060 

gtcaacataa ttggaagaaa tctgttgact cagattggct gcactttaaa ttttcccatt 3120 

agtcctattg agactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa 3180 

caatggecat tgacagaaga aaaaataaaa gcattagtag aaatttgtac agaaatggaa 3240 

aaggaaggaa aaatttcaaa aattgggcct gaaaatccat acaatactcc agtatttgee 33 00 

ataaagaaaa aagacagtac taaatggaga aaattagtag atttcagaga acttaataag 33 60 

agaactcaag atttctggga agttcaatta ggaataccac atectgeagg gttaaaacag 3420 

aaaaaatcag taacagtact ggatgtgggc gatgeatatt tttcagttcc cttagataaa 3480 

gacttcagga agtatactgc atttaccata cctagtataa acaatgagac accagggatt 3540 

agatatcagt acaatgtgct tccacaggga tggaaaggat caccagcaat attccagtgt 3600 

agcatgacaa aaatcttaga gecttttaga aaacaaaatc cagacatagt catctatcaa 3660 

tacatggatg atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata 3720 

gaggaactga gacaacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag 3780 

aaagaacctc cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag 3 840 

cctatagtgc tgccagaaaa ggacagctgg actgtcaatg acatacagaa attagtggga 3 900 

aaattgaatt gggcaagtca gatttatgea gggattaaag taaggcaatt atgtaaactt 3960 

cttaggggaa ccaaagcact aacagaagta gtaccactaa cagaagaagc agagctagaa 4020 

ctggcagaaa acagggagat tctaaaagaa ccggtacatg gagtgtatta tgacccatca 4080 

aaagacttaa tagcagaaat acagaagcag gggcaaggee aatggacata tcaaatttat 4140 

caagagecat ttaaaaatct gaaaacagga aagtatgcaa gaatgaaggg tgcccacact 4200 

aatgatgtga aacaattaac agaggcagta caaaaaatag ccacagaaag catagtaata 4260 

tggggaaaga ctcctaaatt taaattaccc atacaaaagg aaacatggga agcatggtgg 4320 

acagagtatt ggcaagccac ctggattcct gagtgggagt ttgtcaatac ccctccctta 43 80 
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or or 4~ ^ o o ^ or 4~ 4™ 

gguaccdguu 


ay ay aaay a a 


ncpst'aatacr 

LOU UCl^j 


aaacaaaaac 


tttctatgta 


4440 


gatggggcag 


ccaauaggga 


aanfaaaffa 
ddCUddat Ua 


yydddayLay 


aatatohaac 

a L>a io i.u»w 


fccracaaaocra 


4500 


agacaaaaag 


fctgtccccct 


aacggacaca 


aoaaa+'oar'Ta 
aCddaLCdyd 


a rra p +* rra f-r4~ 4- 
dy dL uy ay l l 


a raacrraatl" 
a Laay l a a l l> 


4560 


cacctagctt 


tgeaggatte 


gggau uagaa 


guaaacauag 


tpapapaphr 

uy dLdy dL ll 


araa1*at"crpa 
aLaa Ld lvj Ld 


4620 


ttgggaatca 


uucaagcaca 


accagauaag 


a pt4- rra a 4* o an 

ay tyddLCdy 


a p/4- 4* a p/4- pari 
dy u uay uLdy 


hf*aaa1*aaha 

LLaaa i*aa L»a 


4680 


gagcaguuaa 


4~ a a a a a a rrrra 

uaaaaaagga 


aaaarrl'pt'ap 
daddy LL IdL 


u *-yy u»auy yy 


La l Lay LaLa 




4740 


r^*"»a /"r/^r aaaf 1 or 

ggaggaaaug 


a a o a a or 4~ a rra 

aaCaayLdya 

i 


UdddUUyy LL 


a pit* npt"nnaa 
ay uy u uy y dd 


LLdy y aaay l 


arhattttta 

aLLdLLLU>La 


4800 


/"» 4~ /tot a a 4~ a or 

ga uygaa tag 


a f* a a oforoo o a 
ataayyCLLa 


anaaftaapah 
ayaayddLao 


rra rra aafafp 
y ay aaauauu 


dLdy uaauty 


aaoaaraato 

y ay ay l aa Ly 


4860 


s"t 4" or ^ +• 4— 

gcuagugauu 


ffaar'^fa/i/^ 


dLL uy Ldy ua 


pr-aaaanaaa 
y Uddddy ddd 


hanhanppan 
Ldy uayLLoy 


p j t*cr1 _ cra taa a 
LLy Ly dLaad 


4920 


ugucagc uaa 


a a or or or rra a orp 

aaggggaagc 


cdtyLdLyyd 


r* a a rrt- a rra H 
Lady Lay du u 


ai~ a nr* ppa cjp; 
y uayLLLayy 


a a 1" a f" aac a cr 

aaLaLyyLay 


4980 




Lata LLUdy a. 


anna a a a nt" f* 
ayyaaady u u 


a ll l Lyy Lay 


pant - 1* r*at"at 

l Loa ljj w 


aaccaataaa 

ay wwo,y 33 


5040 


a V af" a or a a or 
LatoLayaag 


partaapthaal* 

cayaayLaaL 


ILL ay U ay ay 


a r 1 a nnrt r* a a n 
aLayyy Laay 


aaai/aytaL a 


ft*"!" cci"cfcta 

v_ L» V-» t» w I— w Ci. 


5100 


aaaeuagcag 


/*ra ^ or^ 4" f^tr^r r~% o 

gaagauggee 


agtadaadtd 


rrhapa f* a pa n 
y LdudUdUdy 


a p a a hnrrpafl 
dLda Ly y l ay 


paai*M"P3 pp 

LaaLLLL>aLL 


5160 


q /t4~ ^ r"«4~ a oi a *T 

aguacuacag 


4" 4* a a /** o r~r o 

LLadyyccyc 


cuguuggugg 


y egggyduca 


a p/pa crrr a a f- 4- 
ay Lay y aaL l 


4-ororr«a f*t*PPP 

yy 


5220 


tacaa ucccc 


aaagucaagg 


age aa uagaa 


^■pf'al'naat'a 
UCUdUy ddUd 


day aa u udaa 


p/a aaafhaha 
y aaaa l l a l a 


5280 


ggacagguaa 


gagaucaggc 


4" r*ra apafp^t - 

ugaacauc uu 


aaaapappap 

ddydudy udy 


4" a pa a a 4- orrTP 
LdLdad tyyL 


ayLaLLLaLL 




CdcaaLCcca 


a a a ora a a a <ir\ 

aaayaaddyy 




rt rrrri - a Pa rr4- rr 
y yy uduay uy 


p a rrrrp/p/a aarr 
Ldyyyy aaay 


a a 4- a or 4- a oja o 
aatay uayaL 


5400 


ataaCayCad 


LaydCdLaLd 


a a t* a a a pa a 
ddL Laaayact 


ffapaaaaap 
U UdLdddddL 


aaafhapaaa 
dddLLdLaaa 


aafhraaaat" 
a a l l» ^aaaa ^ 


5460 


4" 4- 4~ o or/~ror 4-" 4— 

u uucggg u u u 


a 4* 4- a i^arrrrrra 

aLtdCdyyya 


LdyLdydydL 


r* pa rri* t* t~ p/p/a 
l Ldy u u Ly y a 


aanriappanp 
ddyydLLdy l 


aaaarhect" c 
aaay l lll ll 


5520 


4- orora a a or pj ^ or 

t*gg aaa ygt-g 


a a rTOfOfrro a or 4- 

ciay yy y cdy u 


anhaafapaa 
dy Lad UdUdd 


rra haafa p/t"rr 
y dLddLdy Ly 


APaf"aaaa r-r4- 
aoduddddy l 


a oj t crp p a a cr a 

ay LyLLaay a 


558O 


a or a aaarrraa 
dy ddddy Ldd 


a rra t* r»a t* r»a fr 
ay aLLalLay 


rrrra 4- f- = +- nna 
y yd u uduyy d 


a a s p a pja 4- rro: 
a ddL ay a Lyy 


Ldy y Ly d Ly a 


LLy Ly Lyyv-a 


5640 


a or 4- a rra p" a pip; 
ay Lay duayy 


a uy ayy auua 


apapal" rrna a 
oLoLdLyydo 


4- 4- pprrrja rrcrr 
l LLLyyay Ly 


a r* p cr p a cr cr a o 
y LLy Lay y ay 


etttattccfc 


5700 


4- or or or 4- f" p t* #" n 

uyggeuL Lug 


yy ay cay cay 


rra anpa pt*a h 
y ddy LdCLdL 


rfrfrtprfparrprr 

yyy Lgudycg 


H pa a t* p/a pp/p 
LLdduydLy l 


4- rra 00/0/+- a 0 a 
Ly awyy LdLd 


5 760 


ggccagacaa 


uuauuguCug 


guauagugca 


gcagcagaac 


a a 4* 4" 4~ rrn 4" or a 

dduu uyc uy a 


01 01 or o> 4- a 4* 4- or a 

ggy cuauuga 


coo n 


ggcgcaacag 


caucugcugc 


aac ucacagu 


c uggggcauc 


a age age ucc 


«-» rrrrrt a a or a a 4" 

aggcaagaau 


C Q O A 


ccuggc ugcg 


y aaagauacc 


LdddygaLca 


^> rt a 4~ /in4* 

acagcucc ug 


nnrrQ 4" 4~ 4~ rrrrrt 

gggacccggg 


r*»4- 4" oro>4~ 0 4~ or or 

guugcucugg 




aaaac tea uu 


ugcaccacug 


frfhrfo 4" 4- or 

eugugee wug 


ora a 4"orp4" arrf* 

gaaugcuagu 


f- or or a or4" a a 4- a 

uggaguaaua 


a a 4- o> 4" 0 4" p/p/a 

daULUL Lyy d 


0 v v yj 


acagatttgg 


aatcacacga 


cctggatgga 


gtgggacaga 


gaaattaaca 


attacacaag 


6060 


cttccgcgga 


attcacccca 


ecagtgeagg 


ctgcctatca 


gaaagtggtg 


gctggtgtgg 


6120 


ctaatgccct 


ggcccacaag 


tatcactaag 


ctcgctttct 


tgctgtccaa 


tttctattaa 


6180 
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aggttccttt gttccctaag tccaactact aaactggggg atattatgaa gggccttgag 6240 

catctggatt ctgcctaata aaaaacattt attttcattg caatgatgta tttaaattat 6300 

ttctgaatat tttactaaaa agggaatgtg ggaggtcagt gcatttaaaa cataaagaaa 6360 

tgaagagcta gttcaaacct tgggaaaata cactatatct taaactccat gaaagaaggt 6420 

gaggctgcaa acagctaatg cacattggca acagcccctg atgcctatgc cttattcatc 6480 

cctcagaaaa ggattcaagt agaggcttga tttggaggtt aaagttttgc tatgctgtat 6540 

tttacattac ttattgtttt agctgtcctc atgaatgtct tttcactacc catttgctta 6600 

tcctgcatct ctcagccttg actccactca gttctcttgc ttagagatac cacctttccc 6660 

ctgaagtgtt ccttccatgt tttacggcga gatggtttct cctcgcctgg ccactcagcc 6720 

ttagttgtct ctgttgtctt atagaggtct acttgaagaa ggaaaaacag ggggcatggt 6780 

ttgactgtcc tgtgagccct tcttccctgc ctcccccact cacagtgacc cggaatccct 6840 

cgacatggca gtctagcact agtgcggccg cagatctgct tcctcgctca ctgactcgct 6900 

gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 6960 

atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 7020 

caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 7080 

gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 7140 

ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac 7200 

cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 7260 

taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 7320 

cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 7380 

acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 7440 

aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaagaacagt 7500 

atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 7560 

atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 7620 

gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 7680 

gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 7740 

ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 7800 

ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 7860 

tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt 792 0 

accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt 7980 

atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc 8040 
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cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa 8100 

tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg 8160 

tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 8220 

gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 8280 

agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt 8340 

aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg 8400 

gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 8460 

tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 8520 

gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt 8580 

tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 8640 

aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag 8700 

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 8760 

acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacggga tcccctgagg 8820 

gggcccccat gggctagagg atccggcctc ggcctctgca taaataaaaa aaattagtca 8880 
gccatgagc 



8889 



<210> 7 

<211> 4180 

<212> DNA 

<213> Artificial 

<220> 

<223> pLP2 
<400> 7 

aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca 60 

tgccttacaa ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga 120 

tcgtgcctta ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt 180 

ccgcattgca gagatattgt atttaagtgc ctagctcgat acaataaacg ccatttgacc 240 

attcaccaca ttggtgtgca cctccaagct cgagctcgtt tagtgaaccg tcagatcgcc 300 

tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 360 

ccctcgaagc tagtcgatta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 420 
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gacctcctca aggcagtcag actcatcaag tttctctatc aaagcaaccc acctcccaat 480 

cccgagggga cccgacaggc ccgaaggaat agaagaagaa ggtggagaga gagacagaga 540 

cagatccatt cgattagtga acggatcctt agcacttatc tgggacgatc tgcggagcct 600 

gtgcctcttc agctaccacc gcttgagaga cttactcttg attgtaacga ggattgtgga 660 

acttctggga cgcagggggt gggaagccct caaatattgg tggaatctcc tacaatattg 720 

gagtcaggag ctaaagaata gtgctgttag cttgctcaat gccacagcta tagcagtagc 780 

tgaggggaca gatagggtta tagaagtagt acaagaagct tggcactggc cgtcgtttta 840 

caacgtcgtg atctgagcct gggagatctc tggctaacta gggaacccac tgcttaagcc 900 

tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 960 

taactagaga tcaggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 1020 

ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 1080 

gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 1140. 

tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 1200 

ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 1260 

tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 1320 

aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 1380 

acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 1440 

tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 1500 

caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 1560 

gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 1620 

tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 1680 

ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 1740 

ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 1800 

accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 1860 

gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 1920 

tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 1980 

ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 204 0 

ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 2100 

gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 2160 

caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 2220 

ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 2280 
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cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 234 0 

gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 2400 

taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 2460 

tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 2520 

agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 2580 

caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 2640 

ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 2700 

tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 2760 

agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 2 820 

tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 2880 

agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 2940 

gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 3000 

gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 3060 

tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 312 0 

gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 3180 

accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 3240 

accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 3300 

gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 3 360 

ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 3420 

atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 3480 

gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 3540 

cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 3600 

gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 3660 

gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 3720 

tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 3780 

cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 3 84 0 

ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 390 0 

gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 3 960 

acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 402 0 

aggaaacagc tatgacatga ttacgaattc gatgtacggg ccagatatac gcgtatctga 4080 
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ggggactagg gtgtgtttag gcgaaaagcg gggcttcggt tgtacgcggt taggagtccc 
ctcaggatat agtagtttcg cttttgcata gggaggggga 

<210> 8 

<211> 5821 

<212> DNA 

<213> Artificial 



<220> 

<223> pLP/VSVG 
<400> 8 

ttggcccatt gcatacgttg tatccatatc ataatatgta catttatatt ggctcatgtc 60 

caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa tcaattacgg 120 

ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc 180 

cgcctggctg accgcccaac gacccccgcc' cattgacgtc aataatgacg tatgttccca 240 

tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta cggtaaactg 300 

cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt gacgtcaatg 360 

acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt 420 

ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt tggcagtaca 4 80 

tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac cccattgacg 540 

tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt cgtaacaact 600 

ccgccccatt gacgcaaatg ggcggtaggc gtgtaeggtg ggaggtctat ataagcagag 660 

ctcgtttagt gaaccgtcag atcgcctgga gacgccatcc acgctgtttt gacctccata 72 0 

gaagacaccg ggaccgatcc agcctcccct cgaagcttac atgtggtacc gagctcggat 780 

cctgagaact tcagggtgag tctatgggac ccttgatgtt ttctttcccc ttcttttcta 840 

tggttaagtt catgtcatag gaaggggaga agtaacaggg tacacatatt gaccaaatca 900 

gggtaatttt gcatttgtaa ttttaaaaaa tgctttcttc ttttaatata cttttttgtt 960 

tatcttattt ctaatacttt ccctaatctc tttctttcag ggcaataatg atacaatgta 1020 

tcatgcctct ttgcaccatt ctaaagaata acagtgataa tttctgggtt aaggcaatag 1080 

caatatttct gcatataaat atttctgcat ataaattgta actgatgtaa gaggtttcat 1140 

attgctaata gcagctacaa tccagctacc attctgcttt tattttatgg ttgggataag 1200 

gctggattat tctgagtcca agctaggccc ttttgctaat catgttcata cctcttatct 1260 
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tcctcccaca gctcctgggc 


aacgtgctgg 


tctgtgtgct 


gy CCCaLLaU 


4- fhrraraaan 
u L- uyy Laaay 


X O w 
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gLgcctct ug 
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aagctcagat 
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a 4- a a (rra r*f* t" 
d. UctclLy aL. U U 


J. 3 \J U 


aataggcaca gccttacaag 


tcaaaatgcc 


caagagtcac 


aaggctautc 


annanarrtn 
dayCaydLyy 




ttggatgtgt catgcttcca 


aatgggtcac 


tacttgcgat 


utccgctggt 


a t~ rrrra pprra a 




gtatataaca cattccatcc 


gatccttcac 


cccatcxgta 


rraapsa +- ft f* a 
yaaCaatyCa 


a nrra a a rrr 1 a t~ 
aggaadyuaL 


X O O V 


tgaacaaacg aaacaaggaa 


cttggctgaa 


tccaggcttc 


CCUCCLCaaa 


i~f 4- pff ftffz\ 4" a 

gcuguggaLa 


1 74-0 

X / *± \i 


tgcaactgtg acggatgccg 


aagcagtgat 


tgtccaggtg 


actcctcacc 


4~ ^*r4"" ^1 4* ft ft 4- 

augugceggt 


X OU v 


tgatgaatac acaggagaat 


gggttgattc 


acagttcatc 


aacggaaaat 


/-t fi ^ ft a a; 4- 4~ a 

gcagcaaLLa 


X o o u 


catatgcccc actgtccata 


actctacaac 


ctggcattct 


gactauaagy 


Luaaayyy^ l. 


1920 


atgtgattct aacctcattt 


ccatggacat 




4- <-« a p;aj r tfis f ft 

tcag aygacy 


y cty ciy ttatv. 


1980 


atccctggga aaggagggca 


cagggttcag 


aagtaactac 


ct ugcutai-g 


a a a p t~ ft ft Si ft ft 

dadccgydyy 




caaggcctgc aaaatgcaat 


actgcaagca 


ttggggagtc 


agacucccau 


coggcgcLtg 




gttcgagatg gctgataagg 


atctctttgc 


tgcagccaga 


ttccctgaat 


fm n f^ *^ «a a ft ft 

geccagaagg 


9 1 60 


gtcaagtatc tctgctccat 


ctcagacctc 


agtggatgta 


agtctaattc 


aggaege cga 


9 99 n 


gaggatcttg gattattccc 


tctgccaaga 


aacctggagc 


aaaatcagag 


cggguccccc 


9 9 po 

Z Z O v 


aatctctcca gtggatctca 


gctatcttgc 


tcctaaaaac 


/-i ft ft a a f* f*ft 

ccayyaaccy 


y LLuLyL' L. U L. 


2340 


caccataatc aatggtaccc 


taaaatactt 


tgagaccaga 


^5^iaf na /™*a ft 

cacatcagag 


f> prra t- a ht*nr 
LU^alat tyu 


2400 


tgctccaatc ctctcaagaa 


tggtcggaat 


gatcagtgga 


actaccacag 


ff» ra ft ft ft ^ a /~i 4- 

aaagggaac l 




gtgggatgac tgggcaccat 


atgaagacgt 


ggaaatttgga 


cccaacggag 


4" +* f* f* rra rrrfa p 

tuctgaggac 


9 590 


cagttcagga tataagtttc 


ctttatacat 


gattggacat 


jr-v -L- ~» 4— rt 4— f ft ft 

ggtaegtegg 


actccgotct 


9 Rfi 0 


tcatcttagc tcaaaggctc 


aggtgttcga 


acatcctcac 


attcaagacg 


j-i f- i-t f +- f ft f a 

ctgcttcgca 


9fi40 


acttcctgat gatgagagtt 


tattttttgg 


tgatactggg 


ctatccaaaa 




9 700 


gcttgtagaa ggttggttca 


gtagttggaa 


aagctctatt 


gectcttttt 


i.Mi>f f - f Mgf 


z /ou 


agggttaatc attggactat 


tcttggttct 


ccgagttggt 


atccatcttc 


gcactaaatt 


Z oZ U 


aaagcacacc aagaaaagac 


agatttatac 


agacatagag 


atgaaccgac 


ttggaaagta 


Z ooU 


actcaaatcc tgcacaacag 


attcttcatg 


tttggaccaa 


atcaacttgt 


gataccatgc 


2940 


tcaaagaggc ctcaattata 


tttgagtttt 


taatttttat 


gaaaaaaaaa 


aaaaaaaacg 


3000 


gaattcaccc caccagtgca 


ggctgcctat 


cagaaagtgg 


tggctggtgt 


ggctaatgee 


3060 
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ctggcccaca agtatcacta agctcgcttt cttgctgtcc aatttctatt aaaggttcct 3120 

ttgttcccta agtccaacta ctaaactggg ggatattatg aagggccttg agcatctgga 3180 

ttctgcctaa taaaaaacat ttattttcat tgcaatgatg tatttaaatt atttctgaat 3240 

attttactaa aaagggaatg tgggaggtca gtgcatttaa aacataaaga aatgaagagc 33 00 

tagttcaaac cttgggaaaa tacactatat cttaaactcc atgaaagaag gtgaggctgc 3360 

aaacagctaa tgcacattgg caacagcccc tgatgcctat gccttattca tccctcagaa 3420 

aaggattcaa gtagaggctt gatttggagg ttaaagtttt gctatgctgt attttacatt 3480 

acttattgtt ttagctgtcc tcatgaatgt cttttcacta cccatttgct tatcctgcat 3540 

ctctcagcct tgactccact cagttctctt gcttagagat accacctttc ccctgaagtg 3600 

ttccttccat gttttacggc gagatggttt ctcctcgcct ggccactcag ccttagttgt 3660 

ctctgttgtc ttatagaggt ctacttgaag aaggaaaaac agggggcatg gtttgactgt 3720 

cctgtgagcc cttcttccct gcctccccca ctcacagtga cccggaatcc ctcgacatgg 3780 

cagtctagca ctagtgcggc cgcagatctg cttcctcgct cactgactcg ctgcgctcgg 3840 

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 3900 

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 3960 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 4020 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 4 080 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 4140 

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 4200 

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 4260 

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 4320 

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 4380 

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta 4440 

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 4500 

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 4560 

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 4620 

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 4680 

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 4740 

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 4800 

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 4860 

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 4920 
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taaaccagcc 


agccggaagg 


gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 


49oU 


tccagtctat 


taattgttgc 


cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 


C A A A 

5U4U 


gcaacgttgt 


tgccattgct 


acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 


E T A A 


cattcagctc 


cggttcccaa 


cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 


bloU 


aagcggttag 


ctccttcggt 


cctccgatcg ttgtcagaag taagttggcc gcagtgtxat. 


C O O A 


cactcatggt 


tatggcagca 


ctgcataatt ctcttactgt catgccatcc gtaagatgct 


c o q a 


tttctgtgac 


tggtgagtac 


tcaaccaagt cattctgaga atagtgtatg cggcgaccga 


C "3 /I A 


gttgctcttg 


cccggcgtca 


atacgggata ataccgcgcc acatagcaga actttaaaag 


54 UU 


tgctcatcat 


tggaaaacgt 


tcttcggggc gaaaactctc aaggatctta ccgctgttga 


c a c<\ 
54oU 


gatccagttc 


gatgtaaccc 


actcgtgcac ccaactgatc ttcagcatct tttactttca 


C CO A 


ccagcgtttc 


tgggtgagca 


aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 


CCD A 


cgacacggaa 


atgttgaata 


ctcatactct tcctttttca atattattga agcatttatc 


C £ A A 

564U 


agggttattg 


tctcatgagc 


ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 


5700 


gggttccgcg 


cacatttccc 


cgaaaagtgc cacctgacgg gatcccctga gggggccccc 


5760 


atgggctaga 


ggatccggcc 


tcggcctctg cataaataaa aaaaattagt cagccatgag 


5820 


c 






5821 



<210> 9 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> lamin A/C control oligo 

<400> 9 

caccgtgttc ttctggaagt ccagcgaact ggacttccag aagaaca 47 

<210> 10 

<211> 47 

<212> DNA 

<213> Artificial 
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<220> 

<223> lamin A/C control oligo 

<400> 10 

aaaatgttct tctggaagtc cagttcgctg gacttccaga agaacac 47 

<210> 11 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Sequencing primer 

<400> 11 

ggactatcat atgcttaccg 20 

<210> 12 

<211> 17 

<212> DNA 

<213> Artificial 



<220> 

<223> Sequencing primer 

<400> 12 

caggaaacag ctatgac 17 

<210> 13 

<211> 20 

<212> DNA 

<213> Artificial 



<22 0> 
<223> 
<400> 



U6 promoter sequence 
13 
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aaggtcgggc aggaagaggg 20 

<210> 14 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> U6 promoter sequence 
<400> 14 

agcgagcacg gtgtttcgtc 20 

<210> 15 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> U6 promoter sequence with Asp718 and Not I at 5 1 end 
<400> 15 

gtgggtacca aggtcgggca ggaagaggg 29 

<210> 16 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> U6 promoter sequence with Asp718 and Not I at 5 * end 
<400> 16 

gtggcggccg cggtgtttcg tcctttccac aag 33 

<210> 17 
<211> 44 
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<212> DNA 

<213> Artificial 

<220> 

<223> primer for ccdB gene 
<400> 17 

gtggcggccg caaagatcct ccagtggatc cggcttacta aaag 44 

<210> 18 

<211> 57 

<212> DNA 

<213> Artificial 



<220> 

<223> primer for ccdB gene 

<400> 18 

gtgctcgaga aaaaagtcga cacggagccc tccagttata ttccccagaa catcagg 57 

<210> 19 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> part of double stranded oligo containing Bsal and Not I site to en 
gineer Bsal vecto 

<400> 19 

gagaccgcgg ccgcttctcg aggtctcatt 30 



<210> 20 

<211> 30 

<212> DNA 

<213> Artificial 
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<220> 

<223> part of double stranded oligo containing Bsal and Not I site to en 
gineer Bsal vecto 

<400> 20 

tgagacctcg agaagcggcc gcggtctccg 3 0 

<210> 21 

<211> 31 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer for new ccdB region with NotI site, Bsal site, Xbal site 

<400> 21 

cacgcggccg ctggatccgg cttactaaaa g 31 

<210> 22 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for new ccdB region 

<400> 22 

cactctagaa aaaatgagac cttatattcc ccagaacatc agg 43 

<210> 23 

<211> 7 

<212> PRT 

<213> Artificial 



<220> 
<223> 
<220> 



Consensus cleavage site 
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<221> UNSURE 
<222> (2).. (3) 

<223> Xaa can be any amino acid 



<220> 

<221> UNSURE 

<222> (5) . . <5) 

<223> Xaa can be any amino acid. 



<220> 

<221> UNSURE 

<222> (7) . . (7) 

<223> Xaa can be any amino acid except Proline. 



<400> 23 

Glu Xaa Xaa Tyr Xaa Gin Xaa 
1 5 

<210> 24 

<211> 7 

<212> PRT 

<213> Artificial 



<220> 

<223> TEV1 cleavage site 
<220> 

<221> UNSURE 

<222> (7).. (7) 

<223> Xaa can be any amino acid except Proline. 



<400> 24 



Glu Asn Leu Tyr Phe Gin Xaa 
1 5 
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<210> 25 

<211> 7 

<212> PRT 

<213> Artificial 



<220> 

<223> TEV 2 cleavage site 
<220> 

<221> UNSURE 

<222> (7) . . (7) 

<223> Xaa can be any amino acid except Proline. 



<400> 25 

Glu Thr Leu Tyr He Gin Xaa 
1 5 

<210> 26 

<2il> 5 

<212> PRX 

<213> Artificial 



<220> 

<223> EK Cleavage site 

<400> 26 

Asp Asp Asp Asp Lys 
1 5 

<210> 27 

<211> 7 

<212> PRT 

<213> Artificial 

<220> 
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<223> TEV1 cleavage site (cleaves between Gin and Gly) 

<400> 27 

Glu Asn Leu Tyr Phe Gin Gly 
1 5 

<210> 28 

<211> 3 

<212> PRT 

<213> Artificial 

<220> 

<223> ulpl protease recognition site 

<400> 28 

Gly Gly Ser 
1 

<210> 29 

<211> 5 

<212> PRT 

<213> Artificial 

<220> 

<223> Peptide tag 

<400> 29 

Phe His His Thr Thr 
1 5 

<210> 30 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> FlAsH tags 
<220> 
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<221> UNSURE 
<222> (3).. (3) 

<223> Xaa can be any amino acid. In many instances, Xaa is an amino ac 
id with high a-helical propensity 

<220> 

<221> UNSURE 
<222> (4) . . (4) 

<223> Xaa can be any amino acid. In many instances, Xaa is an amino ac 
id with high a-helical propensity 

<400> 30 

Cys Cys Xaa Xaa Cys Cys 
1 5 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial 



<220> 

<223> Bsal digestion site 
<400> 31 

acaccggaga ccggtctcat tttttttcta ga 
<210> 32 



32 



<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Bsal digestion site complementary sequence 

<400> 32 

gctagaaaaa aaatgagacc ggtctccggt gt 
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<210> 33 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> Bsal digestion fragment 

<400> 33 

ttttttttct aga 13 

<210> 34 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 

<223> possible insert with overhang into pENTR/U6-BsaI-ccdB 
<220> 

<221> Unsure 

<222> (6).. (24) 

<223> N can be any nucleotide. 

<400> 34 

caccgnnnnn nnnnnnnnnn nnnn 24 

<210> 35 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 



<223> possible insert (complementary seq. with overhang) into pENTR/U6- 
Bsal-ccd 
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<220> 

<221> Unsure 

<222> (5).. (23) 

<223> N can be any nucleotide. 

<400> 35 

aaaannnnnn nnnnnnnnnn nnnc 

<210> 36 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> siRNA core molecule with overhang 

<400> 36 

uucagugagu agagucauat t 

<210> 37 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> BiRNA core molecule complementary seq. with overhang 

<400> 37 

uaugactcta ctcacugaat t 

<210> 38 

<211> 34 

<212> RNA 

<213> Artificial 
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<220> 

<223> miRNA sequence with mismatches 
<400> 38 

gcgacuguaa acauccucga cuggaagcug ugaa 

<210> 39 

<211> 37 

<212> RNA 

<213> Artificial 

<220> 

<223> miRNA complementary sequence with mismatches 
<400> 39 

gccacagaug ggcuuucagu cgguaguuug cagcugc 

<210> 40 

<211> 24 

<212> RNA 

<213> Artificial 

<220> 

<223> diced miRNA with mismatches 
<400> 40 

uguaaacauc cucgacugga agcu 

<210> 41 

<211> 22 

<212> RNA 

<213> Artificial 

<220> 

<223> diced miRNA complementary sequence with mismatches 
<400> 41 

cuuucagucg gauguuugca gc 



<210> 42 

<211> 23 

<212>. RNA 

<213> Artificial 

<220> 

<223> siRNA 

<400> 42 

auuucgaagu auuccgcgua cgu 

<210> 43 

<211> 23 

<212> RNA 

<213> Artificial 

<220> 

<223> siRNA complementary sequence 

<400> 43 

acguacgcgg aauacuucga aau 

<210> 44 

<211> 25 

<212> RNA 

<213> Artificial 

<220> 

<223> shRNA 

<400> 44 

auuucgaagu auuccgcgua cguuu 

<210> 45 

<211> 28 

<212> RNA 
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<213> Artificial 



<220> 

<223> shRNA complementary sequence 

<400> 45 

cgacguacgc ggaauacuuc gaaauuuu 

<210> 46 

<211> 41 

<212> RNA 

<213> Artificial 

<220> 

<223> miRNA with mismatches 

<400> 46 

cucgagaucu gcgccguacg cggaauacuu cgaaauguga a 

<210> 47 

<211> 47 

<212> RNA 

<213> Artificial 

<220> 

<223> miRNA complementary sequence with mismatches 

<400> 47 

gccacagaug auuucgaagu auuccgcgua cguugcggau ccucgag 

<210> 48 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Directional TOPO cloning site with gene of interest 
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<220> 

<221> Unsure 

<222> (13).. (18) 

<223> N can be any nucleotide. 



<400> 48 

cccttcacca tgnnnnnnaa ggg 23 

<210> 49 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Complementary sequence of Directional TOPO cloning site with gene 
of interes 

<220> 

<221> Unsure 

<222> (6).. (11) 

<223> N can be any nucleotide. 



<400> 49 

cccttnnnnn ncatggtggg tgaaggg 27 



<210> 50 

<211> 11 

<212> DNA 

<213> Artificial 



<220> 

<223> TOPO cloning site 



<400> 50 
cccttaaggg c 



11 
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<210> 51 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> TOPO cloning site complementary sequence with overhang 

<400> 51 
gcccttggtg aaggg 



